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TITLE: Oil Bodies and Associated Proteins as Affinity Matrices 

FIELD OF THE INVENTION 

This invention relates to the use of oil bodies and their 
associated proteins as affinity matrices for the separation and purification 
5 of target molecules from samples. 

BACKGROUND OF THE INVENTION 

Within the general field of biotechnology, the ability to 
effectively separate and purify molecules from complex sources, such as 
living cells, blood serum, or fermentation broth, is of critical importance. 
0 Applications in industry and research laboratories (where, for example, 
purified or partly purified proteins are used) are numerous and well 
documented in prior literature. See, for example, R. Meadon and G. 
Walsh in Biotechnological Advances 1994, 12: pp 635-646. 

The majority of currently employed techniques for the 
5 separation of molecules capitalizes on the innate physical and chemical 
properties of the molecule of interest. Affinity-based purification 
technologies are unique in that they exploit the highly specific biological 
recognition between two molecular species which form an affinity pair. 
Binding of the two entities of the affinity pair occurs in almost all 
3 instances as a result of relatively weak chemical interactions, known as 
non-covalent bonds. Some art-recognized and commonly used affinity 
pairs include antibodies and their binding antigenic substances, nucleic 
acid binding proteins and nucleic acids, lipid binding proteins and lipids, 
lectins and carbohydrates, streptavidin/biotin complexes, protein 
> A/immunoglobulin C complexes, and receptors and their binding 
molecules. 

In general, affinity-based purification processes require that 
one member of the affinity pair is immobilized on a solid substrate or 
matrix that is insoluble in the fluid in which the other member of the 
) pair resides. The molecular species of the affinity pair bound to the 
matrix is generally referred to as the ligand, while the liquid soluble 
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member is generally referred to as the target member. However, it is 
important to note that these definitions do not impose any restrictions in 
a strict chemical sense. The vast majority of current ligand 
immobilization techniques rely on physical or chemical approaches. 
Physical ligand immobilization involves adsorption or entrapment of 
the ligand to a suitable support, while the chemical mode of 
immobilization is characterized by the formation of strong crosslinks or 
covalent attachments between the ligand and the matrix. It is a 
requirement that immobilization is accomplished in such a fashion that 
the capacity of the members of the affinity pair to recognize each other is 
not adversely affected by the immobilization procedure. 

It is a disadvantage of the currently available physical and 
chemical techniques for immobilizing ligands that production processes 
are frequently time consuming and expensive. This is mainly due to the 
fact that immobilization techniques require the separate production of 
matrix material and ligands, which in a subsequent step must be coupled. 
An alternative mode of immobilizing proteins is described in U.S. Patent 
No. 5,474,925 which documents a biological production system for the 
immobilization of enzymes in the fibre of cotton plants. This patent 
discloses what is believed to be the first biologically produced enzyme 
immobilization system and allows a one step production of matrix and 
ligand. 

Subsequent to immobilization of the ligand on the matrix, a 
variety of affinity based purification techniques may be employed to 
accomplish selective binding between the affinity immobilized ligand 
and the target member. Affinity based purification techniques known in 
the prior art include perfusion affinity chromatography, affinity 
repulsion chromatography, hyperdiffusion affinity chromatography, 
affinity precipitation, membrane affinity partitioning, affinity cross-flow 
ultrafiltration and affinity precipitation. In the most widely used affinity 
based purification technique, affinity chromatography, a matrix 
containing a ligand is coated to, or packed on, the inside of a 
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chromatographic column. A complex mixture containing the target 
member is then applied to the chromatographic column. Ideally, only 
the target molecules that specifically recognize the ligand bind in a non- 
covalent fashion to the chromatographic column, while all other 
5 molecular species present in the sample pass through the column. 

In affinity partitioning, two solutions of substantially 
different densities are employed. The complex solution containing the 
target member is mixed with a solution of a different density containing 
the affinity ligand. Subsequent to mixing, the solutions are left to settle 
in order to permit the formation of two separate phases. Molecules tend 
to partition differentially between phases depending on their size, charge 
and specific interactions with the phase-forming solutions. Ligand- 
bound target protein selectively partitions to the phase containing the 
affinity ligand. For example, Coughlin and Baclaski in Biotechnology 
Progress, 1990 6: 307-309 reported the use of the biotin containing organic 
solution isooctane to transfer avidin from an aqueous solution to the 
isooctane solution. However, so far applications of affinity partitioning 
have been limited mainly due to the current lack of availability of 
suitable affinity matrix substances which can be employed in specific 
20 partitioning in two phase systems. 

An important factor for the commercial development of 
biotechnology is the purification of byproducts, which typically accounts 
for 50% or more of the total costs (Labrou, N. and Clonis, Y. D. in the 
Journal of Biotechnology 36: 95-119 (1994)). Many protein purification 
steps rely on column type separation procedures. In particular, large scale 
high-separation techniques such as column chromatography or batch- 
type based protein purification techniques are costly. In addition, crude 
material is less suitable for either column chromatography or batch 
separations, as contaminants may foul up sedimented resins and plug 
columns. Thus, affinity matrices are often only employed in a later stage 
of purification processes where substantial purity is critical, where the 
proteins are present in extremely dilute concentrations, or where high 
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value proteins are required, for example in diagnostic and therapeutic 
proteins. These and other topics related to the use of affinity technology 
in biotechnological processes have been reviewed by Labrou, N. and 
Clonis, Y. D. in the Journal of Biotechnology 36: 95-119 (1994). 
» There is a need in the art to develop novel and economical 

methods for separating and purifying biological products from complex 
mixtures. The present inventors have found that subcellular oil storage 
structures, known as oil bodies, and their associated proteins are useful 
in this regard. 
10 SUMMARY OF THF TNTVENTTOM 

The present invention relates to a novel versatile biological 
system for the production of affinity matrices. The present inventors 
have found that oil bodies and their associated proteins can be used as 
affinity matrices for the separation of a wide variety of target molecules 
such as proteins, carbohydrates, lipids, organic molecules, nucleic acids, 
metals, cells and cell fractions from a sample. 

In accordance with the invention, there is provided a 
method for the separation of a target molecule from a sample 
comprising: 1) contacting (i) oil bodies that can associate, either directly or 
indirectly, with the target molecule with (ii) a sample containing the 
target molecule; and 2) separating the oil bodies associated with the target 
molecule from the sample. The oil bodies and the sample containing the 
target molecule are brought into contact in a manner sufficient to allow 
the oil bodies to associate with the target. Preferably, oil bodies are mixed 
with the target. If desired, the target molecule may be further separated 
from the oil bodies. 

In one aspect, the target molecule has affinity for, or binds 
directly to, the oil bodies or oil body protein. Examples of such targets 
include antibodies or other proteins that bind to oil bodies. 

In another aspect, a ligand molecule may be used to 
associate the target molecule with the oil bodies. 
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In one embodiment, the ligand has natural affinity for the 
oil bodies or oil body protein. In a specific embodiment the ligand is an 
antibody that binds the oil body protein. Such an antibody can be used to 
separate targets having affinity for the ligand antibody such as anti-IgG 
antibodies or protein A. A bivalent antibody may also be prepared 
having binding specificities for both the oil body protein and the target. 
The antibody against the oil body protein may also be fused to a second 
ligand having affinity for the target. 

In another embodiment, the ligand is covalently attached to 
the oil bodies or oil body protein. In one embodiment, the ligand is a 
protein that is chemically conjugated or produced as a fusion protein 
with the oil body protein (as described in WO 96/21029). In the latter 
case, the fusion protein is targeted to and expressed on the oil bodies. In 
one example, the ligand fused to the oil body protein may be hirudin and • 
can be used to purify thrombin. In another example, the ligand fused to 
the oil body protein may be metallothionein and can be used to separate 
cadmium from a sample. In a further example, the ligand fused to the 
oil body protein may be protein A and can be used to separate 
immunoglobulins. In yet another example, the ligand fused to the oil 
body protein may be cellulose binding protein and can be used to separate 
cellulose from a sample. 

In another embodiment, the ligand may be covalently 
attached to the oil bodies. For example, the ligand may be a small organic 
molecule such as biotin. Biotinylated oil bodies can be used to separate 
25 avidin from a sample. 

The present invention also includes modified oil bodies for 
use as an affinity matrix. Accordingly, the present invention includes a 
composition comprising oil bodies associated with a molecule, such as a 
ligand molecule or a target molecule. In one embodiment, the 
composition comprises oij bodies covalently attached to a ligand 
molecule, such as biotin. 
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The present invention also includes an affinity matrix for 
use in separating a target molecule from a sample, comprising oil bodies 
that can associate with the target molecule. The affinity matrix may 
additionally include a ligand molecule associated with the oil bodies, 
wherein the ligand molecule is capable of associating with the target 
molecule. 

Other objects, features and advantages of the present 
invention will become apparent from the following detailed description 
and attached drawings. It should be understood, however, that the 
detailed description and associated examples are given by way of 
illustration only, and various changes and modifications thereto falling 
within the scope of the invention will become apparent to those skilled 
in the art. In addition, reference is made herein to various publications, 
patents and patent applications which are hereby incorporated by 
15 reference in their entirety. 

BRIEF DESC RIPTION OF THE DRAWlMr.g 

Figure 1. The nucleotide and deduced amino acid sequence 
of the 18 KDa oleosin from Arabidopsis thaliana as shown in 
SEQ.ID.NOrl and SEQ.ID.NO:2. 

Figure 2. Sequence of an Arabidopsis oleosin-hirudin 
fusion. Indicated are a portion of the oleosin genomic sequence (from 
base 1-1620 as reported in van Rooijen et al 1992, Plant Mol. Biol. 18: 
1177-1179), a spacer sequence (base 1621-1635, underlined) and the 
synthetic DNA sequence encoding the mature hirudin variant-2 isoform 
(base 1636-1833, italicized) This gene fusion is regulated by the 5' 
upstream region of the Arabidopsis oleosin (bases 1-861) and the 
nopaline synthase termination sequence (base 1855-2109). The sequence 
is also shown in SEQ.ID.NO:3 and SEQ.ID.NO:4. 

Figure 3. Outline of the steps employed in the construction 
of pCGOBHIRT, containing the entire oleosin-hirudin construct. 
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Figure 4. Schematic diagram illustrating the configuration 
of the oleosin-hirudin fusion protein on the oil body and the binding of 
thrombin. 

Figure 5. NaCl elution profiles of thrombin from wild type 
and 4A4 oil body matrices transformed with a construct expressing an 
oleosin-hirudin fusion. 

Figure 6. Purification of a horseradish peroxidase 
conjugated anti-IgG antibody using an anti-oleosin antibody as a ligand. 
Schematic diagram illustrating the configuration of the oleosin / anti- 
oleosin / anti-IgG sandwich complex bound to an oil body. 

Figure 7. Illustrates specific binding of anti-IgG antibodies to 
wild type oil bodies complexed with primary anti-oleosin antibodies as a 
ligand (left) and binding of anti-IgG antibodies to oil bodies which were 
not complexed with primary antibodies prior to binding with the 
15 secondary antibodies (right). 

Figure 8. Sequence of an oleosin metallothionein fusion. 
Indicated are the coding sequence of a B. napus oleosin cDNA (bases 1092- 
1652, van Rooijen, 1993, Ph.D. Thesis, University of Calgary), a spacer 
sequence (bases 1653-1670, underlined) and the human metallothionein 
gene mt-II (bases 1671-1876, Varshney and Gedamu, 1984, Gene, 31: 135- 
145)). The gene fusion is regulated by an Arabidopsis oleosin promoter 
(bases 1-1072) and ubiquitin termination sequence (bases 1870-2361, ubi3' ; 
Kawalleck et al, 1993, Plant Mot. Biol. 21: 673-684). The sequence is also 
shown in SEQ.ID.NO:6 and SEQ.ID.NO:7. 

Figure 9. Outline of the steps employed in the construction 
of pBIOOM3' containing the entire oleosin-metallothionein construct. 

Figure 10. Schematic diagram illustrating the configuration 
of the oleosin-metallothionein fusion protein on the oil body and 
binding of cadmium ions. 

Figure 11. Illustrates the binding (A) and elution (B) of 
cadmium to an oil body matrix from wildtype B. carinata seeds and B. 
carinata seeds transformed with a construct expressing oleosin 
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metallothionein gene fusion. Shown is the percentage cadmium bound 
to the oil body fraction of an oil body fraction harvested from transgenic 
and untransformed seeds. Bars represent average values of 5 replicate 
experiments (binding) and 3 replicates (elution). 
5 Figure 12. Illustrates the binding of protein A expressing S. 

aureus cells to oil bodies treated with varying amounts of anti-oleosin 
IgGs. Bars represent OD 600 readings obtained following the procedures as 
described in Example 5 and using varying amounts of IgGs (0 ul, 3, |j.l, 30 
ul, 100 ul of added IgG). 

Figure 13. Oligonucleotide primers used to amplify the 
sequence of the S. aureus protein A (The sequence is also shown in 
SEQ.ID.NO:8; The protein sequence is also shown in SEQ.ID.NO:9). 
Primer BK266, 5'C TCC ATG GAT CAA CHC A AT GGT TTA TP 3 ' 
(SEQ.ID.NOrlO), a Ncol site (italicized) and a sequence identical to a 
portion of the protein A gene as contained within vector pRITZ2T 
(Pharmacia) (underlined) are indicated. Primer BK267, 5' GC AAG CTT 
CTA ATT TGT TAT CTG CAG GTC 3' (SEQ.ID.NO:!!), a Hindlll site 
(italicized), a stop codon (bold) and a sequence complementary to a 
portion of the protein A gene as contained within pRIT2T (Pharmacia) 
(underlined) are indicated. The PC R product was digested with Ncol and 
Hindlll and ligated into pCGNOBPGUSA (Van Rooijen and Moloney, 
1995, Plant Physiol. 109: 1353-1361) from which the Ncol-GVS-Hindlll 
fragment had been removed. 

Figure 14. Sequence of an Arabidopsis oleosin-protein A 
fusion (The sequence is also shown in SEQ.ID.NO:12 and the protein 
sequence is also shown in SEQ.ID.NO:13 and 14). Indicated are a portion 
of the oleosin genomic sequence (from base 1 - 1626, as reported in van 
Rooijen et al, 1992 Plant Mol. Biol. 18: 1177-1179), a spacer sequence 
encoding a thrombin cleavage site (base 1627 - 1647, underlined) and the 
DNA sequence encoding protein A (base 1648 - 2437, italicized). 
Expression is regulated by the Arabidopsis 5' upstream region of the 
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Arabidopsis oleosin (base 1 - 867) and the nopaline synthase terminator 
region (base 2437 - 2700). 

Figure 15. Schematic diagram illustrating the configuration 
of the oleosin-protein A fusion protein on the oil body and binding of 
5 the immunoglobulin. 

Figure 16. A western blot illustrating the binding of HRP- 
conjugated mouse anti-rabbit antibodies to oil body protein extracts 
obtained from transgenic B. napus lines expressing oleosin-protein A 
fusion proteins. Shown on a Western blot probed with an HRP- 
conjugated antibody are oil body protein extracts from transgenic lines, 
opa 30 (lane 3), opa 31 (lane 4), opa 34 (lane 5), opa 36 (lane 6), opa 47 (lane 
7), opa 93 (lane 8), all expressing an oleosin-protein A fusion protein and 
a control untransformed B. napus line (lane 9), as well as lysates of E. coli 
DH5a transformed with pRIT2T expressing protein A (lane 2) and 
15 untransformed E. coli DH5oc (lane 1). 

Figure 17 illustrates binding and elution of IgGs to oil bodies 
isolated from wildtype B. napus (bn wt) and a transgenic B. napus line, 
expressing an oleosin protein A fusions. Error bars represent the results 
from 4 independent experiments. 
20 DET AILED DESCRIPTION OFTHF INVENTION 

As hereinbefore mentioned, the present invention relates to 
a novel biological affinity matrix system that employs oil bodies and their 
associated proteins. The affinity matrix is suitable for the highly-efficient 
separation of specific targets, including proteins, carbohydrates, lipids, 
nucleic acids, cells and subcellular organelles, metals and ions, from a 
sample. 

The present invention provides a method for the separation 
of a target molecule from a sample comprising: 1) contacting (i) oil bodies 
that can associate either directly or indirectly with the target molecule 
with (ii) a sample containing the target molecule; and 2) separating the 
oil bodies associated with the target molecule from the sample. The oil 
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bodies and the sample containing the target molecule are brought into 
contact in a manner sufficient to allow the oil bodies to associate with the 
target. Preferably, the oil bodies are mixed with the target. Indirect 
association of the oil bodies with the target can be effected using a ligand 
5 molecule that can associate with both the oil bodies and the target 
molecule. The ligand therefore serves to bridge or join the oil bodies 
with the target molecule. If desired, the target molecule may be further 
separated from the oil bodies and the ligand, if present. 

Each of the components of the affinity matrix are discussed 
10 in turn below. 
Targ ets 

The term "target" as used herein denotes a desired molecule 
that one wants to purify, isolate or separate from a sample such as a 
biological mixture. This technology is amenable for use with virtually 
any target for which a ligand can be obtained or any target that can 
directly associate with or bind to an oil body or oil body protein. Possible 
ligand/target pairs include but are not limited to: protein 
subunit/subunit associations, antibodies/antigens, receptor 
protein/signal molecules, nucleic acid binding proteins/nucleic acids; 
lectins/carbohydrates; lipid binding proteins/lipids; ion binding 
proteins/ions; and ligands to surface epitopes /cells or subcellular 
organelles. The target may be obtained from any natural source or may 
be synthesized chemically. If the target is a macromolecule such as a 
protein or nucleic acid it may also be produced in recombinant form 
using any suitable expression system such as bacteria, yeast, plant, insect, 
mammalian, etc. 
Li gands 

The term "ligand" used herein denotes a molecule that is 
capable of associating with both the target molecule and the oil bodies or 
oil body protein (discussed below). The term "associating with" as used 
herein includes both covalent and non-covalent binding of the ligand to 
the oil bodies or the target molecule. For example, the ligand molecule 
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may be covalently attached to the oil bodies and non-covalently associate 
with the target (and vice-versa), or the ligand may non-covalently 
associate with both the oil bodies and the target molecule. The ligand 
may be any molecule that can bridge the oil bodies or oil body protein 
and the target molecule and can include a protein, nucleic acid, 
carbohydrate or small organic molecule. The ligand may be comprised of 
two molecules, a first molecule that associates with the oil bodies and a 
second molecule that associates with the target, wherein the first 
molecule and the second molecule associate with each other. 

The affinity ligand proteins used for this methodology may 
be derived from naturally-occurring, known ligand pairs such as those 
listed above. Alternatively, the ligand may be obtained by screening 
proteins extracted from cells or organisms, synthesized chemically or 
produced in libraries comprised of combinatorial peptide sequences, - 
15 antibodies, or expressed DNA sequences. 

In one embodiment, the ligand has natural affinity for the 
oil bodies or the oil body protein. For example, the ligand may be a 
protein such as an antibody, that has affinity for the oil body protein. The 
ligand may also be a molecule other than a protein which has natural 
affinity for the oil body or oil body protein. Such ligands, capable of 
binding to the oil bodies or oil body protein, may be associated with a 
second molecule that can bind the target molecule. For example, the 
ligand molecule may be an antibody conjugated to avidin and can be 
used to purify biotin from a sample. 

In another embodiment, the ligand is covalently linked to 
the oil bodies or oil body protein by chemical or recombinant means. 
Chemical means for preparing fusions or conjugates are known in the art 
and can be used to prepare a ligand-oil body protein fusion. The method 
used to conjugate the ligand and oil body must be capable of joining the 
ligand with the oil body protein without interfering with the ability of 
the ligand to bind to the target molecule. In one example, the ligand may 
be a small organic molecule such as biotin that is covalently attached to 
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the oil bodies. Biotinylated oil bodies can be used to separate avidin from 
a sample. The present invention also includes modified oil bodies such 
as biotinylated oil bodies for use as an affinity matrix. Accordingly, the 
present invention includes a composition comprising oil bodies attached 
5 to a molecule, such as a ligand or a target molecule. 

In a preferred embodiment, the ligand is a protein and can 
be conjugated to the oil body protein using techniques well known in the 
art. There are several hundred crosslinkers available that can conjugate 
two proteins. (See for example "Chemistry of Protein Conjugation and 
Crosslinking". 1991, Shans Wong, CRC Press, Ann Arbor). The 
crosslinker is generally chosen based on the reactive functional groups 
available or inserted on the ligand. In addition, if there are no reactive 
groups a photoactivatible crosslinker can be used. In certain instances, it 
may be desirable to include a spacer between the ligand and the oil-body 
15 protein. Crosslinking agents known to the art include the 
homobifunctional agents: glutaraldehyde, dimethyladipimidate and 
Bis(diazobenzidine) and the heterobifunctional agents: m- 
Maleimidobenzoyl-N-Hydroxysuccinimide and Sulfo-m- 
Maleimidobenzoyl-N-Hydroxysuccinimide. 

A ligand protein-oil body protein fusion may also be 
prepared using recombinant DNA techniques. In such a case a DNA 
sequence encoding the ligand is fused to a DNA sequence encoding the 
oil body protein, resulting in a chimeric DNA molecule that expresses a 
ligand-oil body protein fusion protein (discussed in greater detail below). 
In order to prepare a recombinant fusion protein, the sequence of the 
DNA encoding the ligand must be known or be obtainable. By obtainable 
it is meant that a DNA sequence sufficient to encode the protein ligand 
may be deduced from the known amino acid sequence. It is not necessary 
that the entire gene sequence of the ligand be used provided that a 
subsequence encoding the binding domain of the protein ligand is 
known. Therefore, the ligand can include the complete sequence of, or 
the binding domain from, the specific ligand protein in question. 
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If the DNA sequence of the desired ligand is known, the 
gene may be synthesized chemically using an oligonucleotide 
synthesizer. Alternatively, the clone carrying the ligand gene may be 
obtained from either cDNA or genomic libraries containing the gene by 
5 probing with a labelled complementary DNA sequence. The gene may 
also be specifically amplified from the library using gene-specific 
oligonucleotide primers and the PCR. If the DNA sequence of the 
desired ligand is not known, then a partial amino acid sequence may be 
obtained through N-terminal sequencing of the protein (Matsudaira 
1987; /. Biol. Chem. 262:10035-10038). Labelled probes may be synthesized 
based upon the DNA sequences deduced from this amino acid sequence 
and used to screen cDNA or genomic libraries as described above. The 
clone carrying the gene may also be identified from a cDNA expression 
library by probing either with antibodies raised against the protein ligand, " 
15 or with the target protein. 

Ligands may also be uncovered by probing mixtures of 
proteins with the target. The target can be immobilized on a support 
matrix and used to screen proteins extracted from cells and tissues or 
synthesized chemically. Following binding between the ligand protein " 
and the immobilized target, the matrix is separated from the solution 
and washed. The protein ligand is subsequently eluted from the matrix 
and the sequence determined as described above. Alternatively, 
recombinant protein libraries produced by phage display, such as those 
comprised of combinatorial peptide sequences (Smith, 1985; Science 228: 
1315-1317) or antibody repertoires (Griffiths et nl, 1994, EMBO J. 13: 3245- 
3260, Nissim et al, 1994, EMBO J. 13: 692-698) can be screened with the 
immobilized target. In this case, binding between the protein ligand and 
the target would enable separation and recovery of the phage expressing 
the ligand from the large, complex population of phage encoding non- 
binding proteins. A two-hybrid system such as that in yeast (Fields and 
Sternglanz, 1994; Trends Genet. 10: 286-292) might also be used to identify 
a ligand from an expressed cDNA library. Here, a gene fusion is 
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constructed between the sequence encoding the target protein and that of 
a DNA binding protein. Cells containing this construct are transformed 
with constructs from a cDNA library where the sequences have been 
fused to that of a transcriptional activator. Binding between ligands 
derived from the cDNA library with the target protein allows 
transcription of a reporter gene to occur. Clones expressing the ligand are 
then recovered. 

To specifically uncover a ligand to oil bodies, a complete or 
partial oleosin protein may be used as target in any of the above methods. 
Alternatively, it may be possible to employ intact oil bodies for screening 
protein extracts, synthetic peptides or phage display libraries. In this case, 
the oil body would serve both as target and immobilization matrix. 
Using this approach, a wider variety of ligands may be uncovered; that 
exhibit affinity not only to oleosins, but to other epitopes present on oil 
15 bodies. 

Oil bodips an d Oil Bndv Proteins 

Oil bodies are small, spherical, subcellular organelles 
encapsulating stored triacylglycerides, an energy reserve used by many 
plants. Although they are found in most plants and in different tissues, 
they are particularly abundant in the seeds of oilseeds where they range 
in size from under one micron to a few microns in diameter. Oil bodies 
are comprised of the triacylglycerides surrounded by a half-unit 
membrane of phospholipids and embedded with a unique type of protein 
known as an oil body protein. The term "oil body" or "oil bodies" as 
used herein includes any or all of the triacylglyceride, phospholipid or 
protein components present in the complete structure. The term "oil 
body protein" as used herein means a protein that is naturally present in 
an oil body. In plants, the predominant oil body proteins are termed 
"oleosins". Oleosins have been cloned and sequenced from many plant 
sources including corn, rapeseed, carrot and cotton. The oleosin protein 
appears to be comprised of three domains; the two ends of the protein, N- 
and C-termini, are largely hydrophilic and reside on the surface of the oil 
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body exposed to the cytosol while the highly hydrophobic central core of 
the oleosin is firmly anchored within the membrane and triacylglyceride. 
Oleosins from different species represent a small family of proteins 
showing considerable amino acid sequence conservation, particularly in 
5 the central region of protein. Within an individual species, a small 
number of different isoforms may exist. 

Oil bodies from individual species exhibit a roughly 
uniform size and density which is dependent in part upon the precise 
protein/ phospholipid /triacylglyceride composition. As a result, they 
may be simply and rapidly separated from liquids of different densities in 
which they are suspended. For example, in aqueous media where the 
density is greater than that of the oil bodies, they will float under the 
influence of gravity or applied centrifugal force. In 95% ethanol where 
the density is less than that of the oil bodies, they will sediment under"' 
the same conditions. Oil bodies may also be separated from liquids and 
other solids present in solutions or suspensions by methods that- 
fractionate on the basis of size. For example, the oil bodies from B. napus 
are minimal, approximately 0.5^m in diameter, and thus may be 
separated from smaller components using a membrane filter with a pore 
size less than this diameter. 

The oil bodies of the subject invention are preferably 
obtained from a seed plant and more preferably from the group of plant 
species comprising: thale cress (Arabidopsis thaliana), rapeseed {Brassica 
spp.), soybean {Glycine max), sunflower (Helianthus anniius), oil palm 
(Elaeis guineeis), cottonseed (Gossypium spp.), groundnut (Arachis 
hypogaea), coconut (Cocus nucifera), castor (Ricinus communis), 
safflower (Carthamus tinctorius), mustard (Brassica spp. and Sinapis 
alba), coriander (Coriandrum sativum) linseed/flax (Lin urn 
usitatissimum), and maize (Zea mays). Plants are grown and allowed to 
set seed using agricultural cultivation practises well known to a person 
skilled in the art. After harvesting the seed and removal of foreign 
material such as stones or seed hulls, for by example sieving, seeds are 
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preferably dried and subsequently processed by mechanical pressing, 
grinding or crushing. The oil body fraction may be obtained from the 
crushed seed fraction by capitalization on separation techniques which 
exploit differences in density between the oil body fraction and the 
aqueous fraction, such as centrifugation, or using size exclusion-based 
separation techniques, such as membrane filtration, or a combination of 
both of these. Typically, seeds are thoroughly ground in five volumes of 
a cold aqueous buffer. A wide variety of buffer compositions may be 
employed, provided that they do not contain high concentrations of 
strong organic solvents such as acetone or diethyl ether, as these solvents 
may disrupt the oil bodies. The solution density of the grinding buffer 
may be increased with the addition of 0.4-0.6 M sucrose, in order to 
facilitate washing as described below. The grinding buffer will also 
typically contain 0.5 M NaCl to help remove soluble proteins that are not 
15 integrally bound to the oil body surface. 

Following grinding, the homogenate is centrifuged 
resulting in a pellet of particulate and insoluble matter, an aqueous phase 
containing soluble components of the seed, and a surface layer comprised 
of oil bodies with their associated proteins. The oil body layer is 
skimmed from the surface and thoroughly resuspended in one volume 
of fresh grinding buffer. It is important that aggregates of oil bodies are 
dissociated as thoroughly as possible in order to ensure efficient removal 
of contaminants in the subsequent washing steps. The resuspended oil 
body preparation is layered under a floatation solution of lower density 
(e.g. water, aqueous buffer) and centrifuged, again, separating oil body 
and aqueous phases. This washing procedure is typically repeated at least 
three times, after which the oil bodies are deemed to be sufficiently free 
of contaminating soluble proteins as determined by gel electrophoresis. 
It is not necessary to remove all of the aqueous phase and to the final 
preparation water or 50 mM Tris-HCl pH 7.5 may be added and if so 
desired the pH may be lowered to pH 2 or raised to pH 10. Protocols for 
isolating oil bodies from oil seeds are available in Murphy, D. J. and 
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Cuirimins I., 1989, Phytochemistry, 28: 2063-2069; and in: Jacks, T. J. et al., 
1990, JAOCS, 67: 353-361. A preferred protocol is detailed in example 1 of 
the present specification. 

Oil bodies other than those derived from plants may also be 
? used in the present invention. A system functionally equivalent to plant 
oil bodies and oleosins has been described in bacteria (Pieper-Furst et al., 
1994, /. Bacterial 176: 4328), algae (Rossler, P.G., 1988, /. Physiol. (London), 
24: 394-400) and fungi (Ting, J. T. et al., 1997, /. Biol Chem. 272: 3699-3706). 
Oil bodies from these organisms, as well as those that may be discovered 
in other living cells by a person skilled in the art, may also be employed 
according to the subject invention. 
Affinity Matrices 

As hereinbefore mentioned, the present invention provides 
a novel affinity matrix system for the purification of a target molecule 
from a sample. In one embodiment, the affinity matrix comprises oil 
bodies that can bind a target molecule in a sample. In such an 
embodiment, the target molecule may be an antibody that can bind an oil 
body protein. In another embodiment, the affinity matrix comprises oil 
bodies or oil body proteins and a ligand that is associated with the oil 
bodies or oil body proteins and has affinity for a target molecule. In such 
an embodiment, the ligand may be non-covalently or covalently attached 
to the oil bodies or oil body protein (as described above). 

It is an advantage of the present invention that target 
substances can be purified or removed from samples through non- 
covalent association with oil bodies followed by oil body separation. A 
number of different oil body-ligand configurations are possible. Targets 
with inherent affinity for a specific ligand proteins such as hirudin to 
thrombin or heavy metals to metallothionein, may be purified or 
separated with oil bodies containing that ligand fused to an oleosin. 
Alternatively, a protein target may also be purified or separated with an 
oil body affinity matrix by fusing the target to an oil body-specific ligand 
or to a ligand complimentary to that fused to an oleosin. If desired, a 
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protease recognition site or chemical cleavage site may be engineered 
between the ligand and the target protein to enable proteolytic removal 
of the ligand from the target protein in the course of purification. A 
multivalent ligand may also be constructed, such as a bivalent single- 
chain antibody, in which one domain of the ligand has an affinity for an 
oil body and the other domain(s) exhibits affinity for the target. In this 
case, neither the oil body nor the target molecule need to be covalently 
fused to a ligand. Also, concatamers of ligands may be used to increase 
the affinity of a matrix for a target, or the sequence of a ligand may be 
mutated to modulate the affinity for a target when such conditions are 
desirable. Further, mixtures of different ligands may be fused to 
recover /remove different types of targets simultaneously. Fusions 
between different ligands may also be constructed to form bridges 
between different types of targets or between targets and the oil body 
affinity matrix. Binding to the affinity matrix may also be achieved by 
forming bridges between ligand or ligand and target sequences, such as 
Zn ++ ions bridging between polyhistidine sequences. 

There are several advantages associated with the use of oil 
body affinity matrices that make them attractive as purification tools. 
The flexibility in design that is possible through the different 
configurations described above, enables a matrix to be constructed to best 
meet the requirements for a specific target. Also, production of the 
matrix as part of a natural biological process in seeds is extremely cost- 
effective, since purification and immobilization of the ligand are not 
necessary. In the case of ■ oleosin-ligand fusions, the ligand is 
immobilized on the oil body as a result of oleosin targeting within the 
cell, while oil body-specific ligands will naturally associate with the 
matrix while present in complex mixtures. Natural immobilization of 
the ligand on the matrix may also be advantageous in that it eliminates 
the requirement for chemical cross-linking that may compromise the 
affinity of the ligand for the target. Finally, oil body affinity matrices offer 
a unique and attractive purification option particularly for large scale 
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operations. The ability to separate the matrix through floatation as a 
loose suspension enables it to be employed with crude material 
containing what might otherwise be prohibitive amounts of particulate 
contaminants. The presence of these contaminants will often foul and 
5 block conventional solid matrices applied in columns or batch 
suspensions limiting their use at early stages in the purification process. 

As mentioned previously, in one embodiment of the 
invention, ligand protein sequences are genetically fused to the oil body 
protein. In order to prepare such genetic fusions, a chimeric DNA 
sequence is prepared that encodes an oil body protein-ligand fusion 
protein and consists of (a) a DNA sequence encoding a sufficient portion 
of an oil body protein to provide targeting of the fusion protein to the oil 
bodies and (b) a DNA sequence encoding a sufficient portion of the ligand 
protein to provide binding of the target. The inventors have determined : ' 
that, in general, the N-terminus and the hydrophobic core of an oil body " 
protein are sufficient to provide targeting of the fusion protein to the oil " 
bodies. In particular, for oleosins derived from the plant Arabidopsis 
thaliana amino acids 2 through 123 (as shown in SEQ.ID.NO:!) are 
sufficient in this regard. 

The ligand may be fused to either the N- and /or C- terminal 
end of the oleosin. It may also be possible to construct an internal fusion 
between the ligand and oleosin or to fuse the ligand between two oleosin 
proteins. The chimeric DNA sequence encoding an oil body protein 
fused to a ligand may be transfected into a suitable vector and used to 
transform a plant. Two types of vectors are routinely employed. The 
first type of vector is used for the genetic-engineering and assembly of 
constructs and typically consists of a backbone such as found in the pUC 
family of vectors, enabling replication in easily-manipulated and 
maintained gram negative bacteria such as £. coli. The second type of 
vector typified by the Ti and Ri plasmids, specify DNA transfer functions 
and are used when it is desired that the constructs be introduced into the 
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plant and stably integrated into its genome via Agrobacterium-mediated 
transformation. 

A typical construct consists, in the 5' to 3' direction, of a 
regulatory region complete with a promoter capable of directing 
expression in plants (preferably seed-specific expression), a protein coding 
region, and a sequence containing a transcriptional termination signal 
functional in plants. The sequences comprising the construct may be 
either natural or synthetic or any combination thereof. 

Both non-seed specific promoters, such as the 35-S CaMV 
promoter (Rothstein et al, 1987; Gene 53: 153-161) and seed-specific 
promoters such as the phaseolin promoter (Sengupta-Gopalan et al., 
1985; PNAS USA 82: 3320-3324) or the Arabidopsis 18 kDa oleosin (Van 
Rooijen et al., 1992; Plant Mol. Biol. 18: 1177-1179) promoters may be 
used. In addition to the promoter, the regulatory region contains a 
ribosome binding site enabling translation of the transcripts in plants and 
may also contain one or more enhancer sequences, such as the AMV 
leader (Jobling and Gehrke 1987; Nature 325: 622-625), to increase the 
expression of product. 

The coding region of the construct will typically be 
comprised of sequences encoding a ligand fused in frame to an oleosin 
and ending with a translational termination codon. The sequence for the 
oleosin may be comprised of any DNA sequence, or part thereof, natural 
or synthetic, sufficient to encode a protein that can be correctly targeted 
to, and stably expressed on, an oil body. A detailed description of the 
characteristics of such a sequence has been reported previously in 
Moloney, 1993; PCT Patent Appl. WO 93/21320 which is hereby 
incorporated by reference. The sequence may also include introns. The 
ligand-encoding region may in turn be comprised of any individual, or 
combination of, ligand sequences identified as described above. If 
desired, a protease or chemical recognition site may be engineered 
between the ligand and the target protein to enable proteolytic removal 
of the ligand from the target protein in the course of purification. 
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The region containing the transcriptional termination 
signal may comprise any such sequence functional in plants such as the 
nopaline synthase termination sequence and additionally may include 
enhancer sequences to increase the expression of product. 

The various components of the construct are ligated 
together using conventional methods, typically into a pUC-based vector. 
This construct may then be introduced into an Agrobacterium vector and 
subsequently into host plants, using one of the transformation 
procedures outlined below. 

A variety of techniques are available for the introduction of 
DNA into host cells. For example, the chimeric DNA constructs may be 
introduced into host cells obtained from dicotyledonous plants, such as 
tobacco, and oleaginous species, such as B. napus using standard 
Agrobacterium vectors; by a transformation protocol such as that ■ 
described by Moloney et al, 1989, {Plant Cell Rep., 8: 238-242) or Hinchee ' 
et al, 1988, (Bio/TechnoL, 6: 915-922); or other techniques known to those .< 
skilled in the art. For example, the use of T-DNA for transformation of 
plant cells has received extensive study and is amply described in EPA 
Serial No. 120,516; Hoekema et al., 1985, (Chapter V, In: The Binary Plant 
Vector System Offset-drukkerij Kanters B.V., Alblasserdam); Knauf, et al, 
1983, (Genetic Analysis of Host Range Expression by Agrobacterium, p. 
245, In Molecular Genetics of the Bacteria-Plant Interaction, Puhler, A. 
ed., Springer-Verlag, NY); and An et al, 1985, (EMBO /., 4: 277-284). 
Conveniently, explants may be cultivated with A. tumefaciens or A. 
rhizogenes to allow for transfer of the transcription construct to the plant 
cells. Following transformation using Agrobacterium the plant cells are 
dispersed in an appropriate medium for selection, subsequently callus, 
shoots and eventually plantlets are recovered. The Agrobacterium host 
will harbour a plasmid comprising the vir genes necessary for transfer of 
the T-DNA to the plant cells. For injection and electroporation, (see 
below) disarmed Ti-plasmids (lacking the tumour genes, particularly the 
T-DNA region) may be introduced into the plant cell. 



V 



WO 98/27115 



-22- 



PCT/CA97/00951 



10 



15 



The use of non-Agrobacterium techniques permits the use 
of the constructs described herein to obtain transformation and 
expression in a wide variety of monocotyledonous and dicotyledonous 
plants and other organisms. These techniques are especially useful for 
species that are intractable in an Agrobacterium transformation system. 
Other techniques for gene transfer include biolistics (Sanford, 1988, 
Trends in Biotech., 6: 299-302), electroporation (Fromm et al., 1985, Proc. 
Natl. Acad. Sci. USA, 82: 5824-5828; Riggs and Bates, 1986, Proc. Natl. 
Acad. Sci. USA 83: 5602-5606) or PEG-mediated DNA uptake (Potrykus et 
al., 1985, Mol. Gen. Genet., 199: 169-177). 

In a specific application, such as to B. napus, the host cells 
targeted to receive recombinant DNA constructs typically will be derived 
from cotyledonary petioles as described by Moloney et al., (1989, Plant Cell 
Rep., 8: 238-242). Other examples using commercial oil seeds include 
cotyledon transformation in soybean explants (Hinchee et al., 1988. 
Bio/Technology, 6: 915-922) and' stem transformation of cotton (Umbeck 
et al., 1981, Bio/Technology, 5: 263-266). 

Following transformation, the cells, for example as leaf 
discs, are grown in selective medium. Once shoots begin to emerge, they 
are excised and placed onto rooting medium. After sufficient roots have 
formed, the plants are transferred to soil. Putative transformed plants 
are then tested for presence of a marker. Southern blotting is performed 
on genomic DNA using an appropriate probe, for example an A. thaliana 
oleosin gene, to show that integration of the desired sequences into the 
2i> host cell genome has occurred. 

The expression cassette will normally be joined to a marker 
for selection in plant cells. Conveniently, the marker may be resistance 
to a herbicide, e.g. phosphinothricin or glyphosate, or more particularly 
an antibiotic, such as kanamycin, G418, bleomycin, hygromycin, 
chloramphenicol, or the like. The particular marker employed will be 
one which will allow for selection of transformed cells compared with 
cells lacking the introduced recombinant DNA. 



20 



30 



WO 98/27115 



-23- 



PCT/CA97/00951 



10 



15 



20 



0 



The fusion peptide in the expression cassette constructed as 
described above, expresses at least preferentially in developing seeds. 
Accordingly, transformed plants grown in accordance with conventional 
ways, are allowed to set seed. See, for example, McCormick et al. (1986, 
Plant Cell Reports, 5: 81-84). Northern blotting can be carried out using 
an appropriate gene probe with RNA isolated from tissue in which 
transcription is expected to occur, such as a seed embryo. The size of the 
transcripts can then be compared with the predicted size for the fusion 
protein transcript. 

Oil body proteins are then isolated from the seed and 
analyses performed to determine that the fusion peptide has been 
expressed. Analyses can be for example by SDS-PAGE. The fusion 
peptide can be detected using an antibody to the oleosin portion of the 
fusion peptide. The size of the fusion peptide obtained can then be * 
compared with predicted size of the fusion protein. 

Two or more generations of transgenic plants may be grown • 
and either crossed or selfed to allow identification of plants and strains 
with desired phenotypic characteristics including production of 
recombinant proteins. It may be desirable to ensure homozygosity of the 
plants, strains or lines producing recombinant proteins to assure 
continued inheritance of the recombinant trait. Methods of selecting 
homozygous plants are well know to those skilled in the art of plant 
breeding and include recurrent selfing and selection and anther and 
microspore culture. Homozygous plants may also be obtained by 
transformation of haploid cells or tissues followed by regeneration of 
haploid plantlets subsequently converted to diploid plants by any 
number of known means, (e.g.: treatment with colchicine or other 
microtubule disrupting agents). 

Method of Separating Target Molpn.lp* U sing th( , Aff inity M.fri... 

As hereinbefore mentioned, the present invention relates to 
a method of separating a target molecule from a sample using the above 
described oil body proteins and in some cases, ligands. In the method of 
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the invention, oil bodies are mixed with a sample containing the desired 
target and the interaction between the ligand and target results in the 
non-covalent association of the target with the oil body. Following 
centrifugation, the oil bodies and affinity-bound target are separated from 
the aqueous phase, effectively purifying the target from any 
contaminants present in the original sample. Repeating the washing 
step ensures that any remaining contaminants are removed. 

Following their attachment to oil bodies, targets may be 
eluted under conditions determined empirically for each individual 
ligand-target pair. Treatment of the bound matrix with the appropriate 
eluent and centrifugation enables recovery of the purified target in the 
aqueous phase. If the target is a ligand-protein fusion containing a 
protease recognition site, then it may be treated with the appropriate 
protease to remove the ligand. The free ligand may then be separated 
from the target protein by re-application of the oil body affinity matrix or 
through conventional protein purification methods. 

The chemical and physical properties of the affinity matrix 
may be varied in at least two ways. Firstly, different plant species contain 
oil bodies with different oil compositions. For example, coconut is rich 
in lauric oils (C12), while erucic acid oils (C22) are abundantly present in 
some Brassica spp. Furthermore, proteins associated with the oil bodies 
will vary between species. Secondly, the relative amounts of oils may be 
modified within a particular plant species by applying breeding and 
genetic engineering techniques or a combination of these known to the 
skilled artisan. These techniques aim at altering the relative activities of 
enzymes controlling the metabolic pathways involved in oil synthesis. 
Through the application of these techniques, seeds with a sophisticated 
set of different oils are obtainable. For example, breeding efforts have 
resulted in the development of a rapeseed with a low erucic acid content 
(Canola) (Bestor, T. H., 1994, Dev. Genet. 15: 458) and plant lines with oils 
with alterations in the position and number of double bonds, variation 
in fatty acid chain length and the introduction of desirable functional 
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groups have all been generated through genetic engineering (Topfer et 
al., 1995, Science, 268: 681-685). Using similar approaches a person skilled 
in the art will be able to further expand on the presently available sources 
of oil bodies. Variant oil compositions will result in variant physical and 
chemical properties of the oil body fraction. Thus by selecting oilseeds or 
mixtures thereof from different species or plant lines as a source for oil 
bodies, a broad repertoire of oil body matrices with different textures and 
viscosities may be acquired. 
Applications of Oil Bndy Affinity Matrices 

Given that it is possible to engineer oil body affinity 
matrices for several classes of proteins, multiple uses for oil body based 
affinity matrices are envisioned. Bacteria, fungi, plants and animals all 
contain proteins which are able to specifically interact with agents such as 
ions, metals, nucleic acids, sugars, lipids and other proteins. These agents^ 
15 may be immobilized using oil body technology. 

The oil body protein affinity matrices can be used to isolate- 
any target molecule that can bind to the oil body protein, either directly 
or indirectly through a ligand molecule. Examples of target molecules 
that may be isolated from a sample using the methodology of the present 
invention include proteins, peptides, organic molecules, lipids, 
carbohydrates, nucleic acids, cells, cell fragments, viruses and metals. In 
particular, the inventors have shown that the affinity matrix of the 
present invention can be used to separate therapeutic proteins (such as 
thrombin), antibodies, metals (such as cadmium), carbohydrates (such as 
25 cellulose), organic molecules (such as biotin) and cells (such as bacterial 
cells). 

Oil body affinity matrices may also be used to separate cells 
of industrial or medical interest from a mixed population of cells. For 
example haematopoietic stem cells, which are a subpopulation of blood 
cells and are used in bone marrow transplantations and in stem cell gene 
therapies, may be separated from other blood cells using oil body based 
affinity technology. In recombinant DNA technology it is often required 



20 



30 



10 



15 



WO 98/27115 PCT/CA97/00951 

-26- 

that cells in which recombinant DNA has been successfully introduced, 
known as transformed cells, are distinguished and separated from cells 
which failed to acquire recombinant DNA. Provided that part of the 
recombinant DNA expresses a cell surface protein which is 
complementary to a oil body based affinity ligand, it is possible to utilize 
oil bodies to separate transformed cells from untransformed cells. Oil 
body affinity technology may also be used to separate cellular organelles 
such as chloroplasts and mitochondria from other cellular material. 
Viral particles may also be separated from complex mixtures. 

It is also possible to immobilize a class of proteins known as 
metalloproteins, which contain prosthetic groups that specifically bind 
ions. Examples of metalloproteins are haemoglobin, which binds iron, 
parvalbumbin which binds calcium and metallothionein a protein 
which binds zinc and other metal ions. It is envisioned that oil bodies 
could be used to scavenge metals from streams of flowing material, 
which might be water contaminated with the waste of metals from 
laboratories and industrial processes. Example 4 given below further 
illustrates this application. Other examples where proteins may be 
bioimmobilized and employed in a bioremediation strategy include the 
removal of phosphates, nitrates and phenols from waste streams. In part 
this approach may overcome the real or perceived limitations of bacterial 
bioremediation. In certain instances it may not be practical or necessary 
to rely on affinity partitioning technology to separate the oil body matrix 
from the target compound. In these instances, it is envisioned that oil 
bodies may be immobilized on a solid inert surface which could be a flat 
surface or the surface of a column. A solution containing the affinity 
ligand may then be passed over the surface coated with immobilized oil 
bodies whereupon selective affinity binding occurs. It is envisioned that 
immobilized oil bodies may be used in pipes and in ponds to assist in 
30 bioremediation. 

The following examples illustrate various systems in which 
oil bodies can be used as affinity matrices. It is understood that the 
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examples given below are intended to be illustrative rather than 
limiting. 

EXAMPLES 

EXAMPLF 1 
> Purification of Thrombin 

The following example demonstrates the utility of an oil 
body affinity matrix for the purification of thrombin. Thrombin is a 
serine protease which plays a central role in blood coagulation. It cleaves 
fibrinogen to produce fibrin monomers which polymerize to form the 
basis of a blood clot (Fenton 1981; Ann. N.Y. Acad. Sci. 370: 468-495). 
Alfa-thrombin consists of two polypeptide chains of 36 (A-chain) and 259 
(B-chain) residues linked by a disulphide bridge. Degen et al. 1983; 
Biochemistry 22: 2087-2097). Hirudin, which is found in the salivary 
glands of the medicinal leech Hirudo medicinalis, is a very specific and 
15 potent inhibitor of thrombin. This inhibition is a result of the 
non-covalent binding of hirudin to specific parts of the alfa-thrombin 
chain. (Stone and Hofsteenge 1986; Biochemistry 25: 4622-4628). 

The immobilized ligand is comprised of an isoform of 
hirudin fused to the 18 kDa Arabidopsis oleosin (oil body protein) (Van" 
Rooijen et al, 1992; Plant Mol. Biol. 18: 1177-1179). Expression of the 
construct is regulated by the Arabidopsis 18 kDa oleosin promoter (Van 
Rooijen et al, 1994; Plant Mol. Biol. 18: 1177-1179). The sequence of the 
oleosin-hirudin fusion is shown in Figure 2 and in SEQ.ID.NO:3. 
Oleosin-Hir udin Cnnstmrt 

Oligonucleotide primers were designed based upon the 
reported sequence for a Brassica napus oleosin gene (Murphy et al. 1991, 
Biochim. Biophys. Acta 1088: 86-94) and used to amplify a fragment from 
B. napus genomic DNA through PCR. Using this fragment as a probe, a 
clone carrying a 15 kbp insert was identified and isolated from a EMBL3 
Arabidopsis genomic library. Oligonucleotide primers were used to 
amplify a fragment from this insert containing the entire oleosin coding 
sequence and intron together with 840 basepairs of the 5' upstream 
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region. The primers were designed so as to eliminate the translational 
stop codon and to introduce a PstI restriction endonuclease recognition 
site at the 5' end and a Sail followed by a Pvul site at the 3' end of the 
fragment. The fragment was end-filled and ligated into the Smal site of 
the plasmid vector P UC19. A Sail - EcoRI fragment from plasmid P BI121 
(Clontech) comprising the nopaline synthetase terminator sequence was 
then inserted to generate pOBILT. 

A synthetic hirudin variant 2 (HV2) sequence was 
synthesized based upon reported sequence information (Harvey et al. 
1986, Proc. Natl. Acad. Sci. USA 83: 1084-1088) but employing B. napus 
and Arabidopsis codon usage. The sequence was amplified using four 
overlapping oligonucleotide primers designed such that the resulting 
fragment possessed Pvul and Sail sites at the 5' and 3' ends respectively. 
This fragment was ligated into the Smal site of the pUC19 plasmid vector 
to generate pHIR. The Pvul - Sail fragment from pHIR was then inserted 
into pUCOBILT between the oleosin and terminator sequences to form 
an in-frame fusion with the oleosin coding region giving pUCOBHIRT. 
The entire construct was subcloned into pBluescript KS+ (pBIOBHIRT) 
and then into the PstI site of P CGN1559 plasmid (McBride and 
Summerfelt, 1990, Plant Mol. Biol. 14: 269-276) carrying a neomycin 
phosphotransferase gene under control of the 35-S CaMV promoter 
(pCGOBHIRT). This plasmid was introduced into Agrobacterium 
tumefaciens. The preparation of this plasmid is shown in Figure 3. 
Transf ormation and Regeneration 

Procedures for the transformation of Agrobacterium and 
plants have been described previously. Agrobacterium tumefaciens was 
transformed with the above construct through electroporation (Dower et 
al., 1988; Nucl. Acids Res. 16: 6127-6145). The transformed bacteria were 
then used to transform cotyledonary explants of Brassica napus, 
followed by plant regeneration according to the methods of Moloney et 
al. (1989; Plant Cell Reports 8: 238-242). Transgenic plant were initially 
identified using a neomycin phosphotransferase assay and subsequently 
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confirmed by expression of the oleosin-hirudin fusion as determined 
through northern and immunoblot analysis. 
Preparation of Oil Bodies 

Seed from either control (non-transgenic) plants or 
> transgenic plants expressing the oleosin-hirudin fusion were 
homogenized in five volumes of cold grinding buffer (50 mM Tris-HCl, 
pH 7.5, 0.4 M sucrose and 0.5 M NaCl) using a polytron operating at 
high-speed. The homogenate was centrifuged at approximately 10 x g for 
30 min. to remove particulate matter and to separate oil bodies from the 
aqueous phase containing the bulk of soluble seed protein. Oil bodies 
were skimmed from the surface of the supernatant with a metal spatula 
and placed in one volume of fresh grinding buffer. To achieve efficient 
washing in subsequent steps, it was important to ensure that the oil 
bodies were thoroughly redispersed. This was accomplished by gently 
re-homogenising the oil bodies in grinding buffer with the polytron 
operating at low-speed. Using a syringe, the resuspended oil bodies were ; 
carefully layered underneath five volumes of cold 50 mM Tris-HCl, pH 
7.5 and centrifuged as above. Following centrifugation, the oil bodies , 
were again removed and the washing procedure repeated three times to 
remove residual contaminating soluble seed proteins. The final washed ' 
oil body preparation was resuspended in one volume of cold 50 mM 
Tris-HCl pH 7.5, redispersed with the polytron, and was then ready for 
use as an affinity matrix. 
Affinity Purificati on of Thrnmhin 

The purification of thrombin using the oleosin-hirudin 
fusion protein is shown schematically in Figure 4. In order to evaluate 
the binding of thrombin, affinity matrices were prepared from transgenic 
Brassica napus seeds expressing the oleosin-hirudin fusion protein (4A4 
seeds) (Parmenter et al. Plant Molecular Biology (1995) 29: 1167-1180) 
and from wild type Brassica napus cv Westar seeds. Binding of 
thrombin to both matrices was evaluated. Procedures for the preparation 
of washed oil bodies from seeds were the same as those described above. 
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Solutions containing a range of thrombin activities between 0 and 1 units 
were mixed with 10 ul of a fixed amount of affinity matrix (prepared 
from a total of 10 mg of dried seeds; corresponding to approximately 100 
ug of total oil body protein) in 500 ul binding buffer (50 mM Tris-HCl (pH 
7.5); 0.1% (w/v) BSA). The oil body suspension was then incubated for 30 
minutes on ice and centrifuged at 14,000 rpm for 15 minutes at 4°C. The 
buffer under the oil bodies ( termed 'unternatant') containing the 
unbound, free thrombin was recovered using an hypodermic needle and 
assayed for thrombin activity as follows. A total of 250 ul of unternatant 
was added to 700 ul binding buffer and prewarmed to 37°C. Following 
the addition of 50 u 1 of 1 mM thrombin substrate 
N-p-tosyl-gly-pro-arg-p-nitroanilide (Sigma) to the unternatant, the 
change in optical density at 405 nanometers was monitored 
spectrophotometrically for 3 minutes. The concentration of thrombin in 
the assay mixture was determined employing a standard curve which 
was constructed using a set of thrombin samples containing known 
concentrations of thrombin. The values obtained from these assays were 
used to calculate the concentration bound thrombin assuming: 

[bound thrombin] = [total thrombin] - [free thrombin] 
The ratio of the concentration of bound over the 
concentration of free thrombin was plotted as a function of the 
concentration of bound thrombin (Scatchard plot). From these plots the 
dissociation constants of the affinity matrix were calculated following 
standard procedures (Scatchard, G. Ann. N.Y. Acad. Sci. (1949) 57: 
660-672) and assuming: K a = 1/K d . The dissociation constants of the 
affinity matrices were 3.22 x lO-'m for wild type and 2.60 x 10-8 m f or 4A4 
oil bodies. 

In order to evaluate the recovery of bound thrombin from 
the matrices a NaCl gradient was employed. The elution profile of 
thrombin bound to oleosin-hirudin oil body matrices was compared with 
the profile from thrombin bound to wildtype oil body matrices. 
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Procedures for preparation of wild type oil bodies from wild type Brassica 
napus cv Westar seeds and for the preparation of oleosin-hirudin oil 
bodies from. Brassica napus 4A4 seeds (Parmenter et al. Plant Molecular 
Biology (1995) 29: 1167-1180) were identical to those described above. 
5 Procedures for binding of thrombin to the matrices were as described 
above, except 100 M l aliquots of oil bodies were used to bind 0.5 units of 
thrombin. Oil body suspensions were left on ice for 30 minutes prior to 
centrifugation for 15 minutes at 4°C and 14,000 rpm. The unternatant 
was assayed for (unbound) thrombin activity. The oil body matrix was 
then resuspended in binding buffer to which NaCl was added to a final 
concentration of 0.05 M. Starting with the 30 minutes incubation of the 
oil body suspension on ice, the procedure was repeated five times 
increasing the NaCl concentration in a stepwise fashion. The final NaCl 
concentrations used were 0.05 M, 0.1 M, 0.2 M, 0.3 M, 0.4 M and 0.6 M. 
The NaCl concentrations in the thrombin assay were kept constant at 150 
mM. Figure 5 shows the elution profiles obtained when wildtype oil ' 
bodies and 4A4 oil bodies were used. 
EXAMPT F 7 

Use of Antibodies as Bivalent Ligands 

Antibodies may be used as bivalent ligands by virtue of their 
affinity both for specific epitopes and for other antibodies or proteins (for 
example the Staphylococcus aureus protein A) which have affinity for 
immunoglobulins (IgGs). In this example, polyclonal anti-oleosin 
antibodies serve as a bivalent ligand and antibodies raised in rabbits 
against the anti-oleosin antibodies serve as the target. This example is 
illustrated schematically in Figure 6. 

Oil bodies were prepared from 5 g of wild type Brassica 
napus cv Westar seeds following the procedure described in Example 1. 
Subsequently, oil bodies were washed twice with 100 mM glycine (pH 
2.5), neutralized through two washes in binding buffer (50 mM Tris-HCl, 
pH 7.5) and resuspended in 5 ml of binding buffer. A 150 ul aliquot of the 



WO 98/27115 



-32- 



PCT/CA97/00951 



10 



15 



washed oil body preparation was combined with 500 ul of rabbit serum 
containing anti-oleosin antibodies (ligand antibodies), diluted 1:10 with 
binding buffer. The oil body suspension was mixed thoroughly and 
incubated for 1 h at 4°C with agitation. Following incubation, unbound 
ligand antibodies were removed from the oil body suspension through 
three washes with 1 ml of binding buffer. Oil bodies were then combined 
with 500 ul of serum diluted 1:500 in binding buffer and containing 
anti-rabbit IgG antibodies (the target antibodies) conjugated with 
horseradish peroxidase (HRP) as a detection label (Sigma). This 
suspension was mixed and incubated under conditions identical to those 
used for the anti-oleosin antibody binding. As a control, target antibodies 
were incubated with oil bodies which had not been Previously bound to 
ligand antibodies. Both samples were subsequently washed four times 
with 1 ml of binding buffer to remove unbound antibodies. Using 
binding buffer, the samples were equalized with respect to concentration 
of oil bodies as determined by measuring sample turbidity 
spectrophotometrically at 600 nm. To assay for bound target antibody, 
samples containing 5 ul of oil bodies were mixed with 1 ml of the HRP 
colorimetric substrate tetramethylbenzidine in 0.01% hydrogen peroxide 
and reacted for 10 minutes at room temperature. Reactions were stopped 
by the addition of 500 ul of 1 M H 2 S0 4 and the absorbance at 450 nm was 
determined. Corrections for the presence of residual, unbound target 
antibody remaining after washing were made by assaying 5 ul of the final 
wash fraction. The results obtained for control and ligand bound oil body 
25 preparations are set forth in Figure 7. 
EXAMPT F i 

Use of Oleosin-Specific Ligands 

The use of an oleosin-specific ligand represents an 
alternative to the use of an antibody or genetically-engineered oleosin 
fusion proteins for the purification of recombinant target proteins. In 
this case, the target protein is fused to the oleosin-specific ligand and the 



20 



30 



10 



15 



20 



25 



30 



WO 98/27115 PCT/CA97/00951 

- 33 - 

endogenous oleosins present on the oil bodies of non-transgenic seeds 
serve as the complementary ligand-affinity matrix. In addition to 
eliminating the requirement for a transgenic line expressing an oleosin 
fusion, this approach increases the overall capacity of the affinity matrix, 
since all of the endogenous oleosins may now participate in binding. 

Oleosin-specific ligands may be identified and isolated from 
a peptide phage display library screened with oleosin protein. Since the 
extreme hydrophobicity of the oleosin central domain can result in 
aggregation and precipitation of the protein when removed from oil 
bodies, a mutant protein lacking this domain may be used for screening. 
This has little effect on the efficacy of the ligand, as only the hydrophillic 
portions of the oleosin are exposed to the cytoplasm (i.e. the N- and 
C-termini). Hence, these are the only regions available for binding to a 
Hgand. Once isolated, the ligand may be fused to a common reporter 
protein, green fluorescent protein (GFP) (Prasher, 1995, Trends Genet. 
11:320-323), to demonstrate purification. 
Removal nf fho Oleosin Central Dnmain 

Oligonucleotide primers specific for the Arabidopsis oleosin 
gene described above can be used to amplify an oleosin gene from a B. 
napus cDNA library (van Rooijen 1993, Ph.D. Thesis, University of 
Calgary). Primers flanking sequences encoding the N-terminal 62 amino 
acids and the C-terminal 55 amino acids, may be used to amplify 
sequences for the respective N- and C-terminal oleosin domains in 
separate reactions. Additionally, the primer for the 5' end of the 
N-terminal domain contains a sequence for a thrombin recognition site 
to enable cleavage of the fusion protein as described below. The resulting 
fragment was ligated into the Smal site of the bacterial expression vector 
pEZZ 18 (Pharmacia). This vector contains sequences encoding a signal 
peptide for protein secretion into the periplasm, and synthetic IgG 
binding domains derived from protein A to facilitate protein 
purification, downstream of the multiple cloning site. 
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Expression and Purification of tho Ol e osin Option Construct 

The vector carrying the deletion mutant construct is 
introduced into E. coli using standard methods and transformants 
selected. A culture of the transformed bacteria can be induced to express 
; the synthetic protein A-mutant oleosin fusion protein by addition of 1 
mM IPTG. Induced cells may be pelleted and resuspended in 5 mM 
MgS0 4 causing lysis of the periplasmic membrane through osmotic 
shock. The lysed cells are centrifuged and the supernatant containing the 
secreted protein is loaded on to a column containing IgG-coupled 
sepharose. After washing to remove unbound protein, the column is 
loaded with a buffer containing 50 mM Tris-HCl, pH 7.5, 150 mM NaCl 
and l.OU/ml of purified Bovine thrombin (Sigma) to cleave the mutant 
oleosin from the synthetic protein A. Following incubation at 37°C for 
4h, the column is drained and the eluate passed through a column of 
heparin-coupled sepharose to remove thrombin. The eluate from this 
column, containing the mutant oleosin protein, is recovered and purity 
of the protein examined through gel electrophoresis followed by staining 
with Coomassie blue R250. 

Generation of a Peptide r om bin a tnri a l f iK.^y 

A random peptide combinatorial library may be generated 
according to the methods of Scott and Smith (1990; Science 249: 386-390). 
Briefly, the PCR is used to amplify a synthetic DNA fragment containing 
the degenerate sequence (NNK) 6 ; where 'N' represents an equal mixture 
of deoxynucleotides G, A, T, and C, and K represents an equal mixture of 
deoxynucleotides G and T. The degenerate sequence encodes for 
hexameric peptides among which are represented every possible 
combination of the 20 amino acids and amber stop codon. The PCR 
pror-ct is ligated into the gene ITI sequence of the filamentous 
bact, .ophage fUSE and the resulting phagemid introduced into E. coli 
30 through electroporation. 
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Identification and I solation nf Oleosin-Specifir T ig anHs 

The peptide phage display libraries are amplified, 
concentrated and stored in aliquots of 1012 tdu/ml. Purified mutant 
oleosin protein is biotinylated using a thiol-cleavable linker (S-S biotin, 
Pierce) and purified by size exclusion chromatography. Aliquots of the 
peptide phage display library containing 5x10" tdu in two ml are 
screened with the biotinylated protein at a concentration of 50 nM. Phage 
binding the mutant oleosin protein are recovered using 
streptavidin-coated paramagnetic beads. Following washing, the phage 
are eluted through the addition of 50mM dithiothreitol which cleaves 
the disulphide bond. The eluted phage are then incubated with an excess 
of log-phase F+ E. coli. Aliquots of the infected cells are plated to 
determine the phage titre and the remaining cells used in successive 
rounds of amplification and screening. Following enrichment of the 
eluted phage by 3-4 orders of magnitude, individual phage are selected" 
and tested for binding to mutant oleosin by direct ELISA. Binding by 
phage is detected using anti-phage antibodies (Crosby and Schorr, 1995, In 
Annual Review of Cell Biology). Single stranded DNA is isolated from 
phage exhibiting binding and the peptide-encoding sequence determined. • 
Affinity Piirifir a H on with O leosin-Sp p cific Lig anris 

The sequence for an oleosin ligand isolated as described 
above is fused in-frame upstream the sequence for gfplO (Prasher et al, 
1992, Gene 111: 229-233) encoding GFP and the construct ligated into the 
bacterial expression vector pKK233 (Pharmacia). Soluble protein is 
extracted through sonication of cells induced to express the ligand-GFP 
fusion, and adjusted to a concentration of 10 mg/ml in 50 mM Tris-HCl, 
pH 7.5. 

Twenty ml of the protein solution is mixed with 2ml of oil 
bodies prepared as described above, from seeds of non-transgenic plants. 
The mixture is incubated at 4°C for 30 min with agitation to allow 
binding and then centrifuged to separate the oil bodies and soluble 
fraction. The amount of GFP remaining in the soluble fraction after 
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removal of oil bodies is determined by fluorescence specrrofluorometry 
at a wavelength of 508 nm and compared with that in the original 
bacterial extract. The amount of bound GFP is calculated to determine 
the capacity of the matrix. 

The oil bodies are washed twice in 20 ml of 50 mM Tris-HCl, 
pH 7.5, resuspended in 2 ml of the same buffer and divided into 20 
aliquots of 100 ul. Conditions for the elution of ligand-GFP fusion 
protein are determined by adding 1ml of solutions ranging in pH from 
2-10 and in NaCl concentration from 0-1 M to different aliquots. After 
mixing and incubation at 4°C for 30 min, the oil bodies are removed and 
the soluble fractions collected. The amount of ligand-GFP fusion protein 
in the soluble fraction is determined by fluorescence spectrophotometry. 
EXAMPT F a. 

Removal of Heavy Metal Ions 

The following example demonstrates the utility of oil body 
affinity matrices for the recovery /removal of non-protein targets from 
complex solutions. For the purpose of this example the 
metallothionein/Cd- ligand pair was used. However other metal 
binding proteins such as phytochelatins (Rauser, 1990; Ann. Rev. 
Biochem; 59: 61-86) and metal ions including Cu~ and Zn- could also be 
used. 

Oleosin-Mptallot hionein Fminn 

An oleosin gene from a B. napus cDNA library (van 
Rooijen 1993, Ph.D. Thesis, University of Calgary) was amplified 
through PCR with oligonucleotide primers designed so as to create NotI 
and Ncol sites at the 5' and 3' ends of the gene respectively. The resulting 
fragment was digested and placed into the NotI /Ncol sites of pGN to 
yield plasmid poleGN. The human metallothionein gene, mt-II 
(Varshney and Gedamu, 1984, Gene, 31: 135-145) was amplified using 
oligonucleotide primers designed to create a unique NotI site at the 
3 -end of the gene. The resulting PCR product was subcloned into the 
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blunt-end EcoRV site of pBluescript KS+ to form pBSMTC. The mt-II 
gene was then excised from this plasmid and subcloned into the 
Ncol/Kpnl sites of poleGN replacing the GUS-NOS region to generate 
pOLEMTC. The 773 base oleosin-MT fusion of pOLEMTC was excised 
> with Not I digestion and inserted into the unique NotI site of polePN3" 
between the oleosin promoter (oleP; Van Rooijen et al, 1992, Plant Mol. 
Biol. 18: 1177-1179) and the P. crispum ubi4-2 gene terminator (ubi3'; 
Kawalleck et al., 1993, Plant Mol. Biol. 21: 673-684.) to generate pOOM3'. 
After the fusion was determined to be in the correct orientation, pOOM3' 
was digested with Kpnl to release the oleP-oleMT-ubi3' insert. This 
expression cassette was inserted at the Kpnl site of the binary vector 
PCGN1559 to yield the final construct pBIOOM3\ The sequence of the 
oleosin-metallothionein fusion is shown in Figure 8 and SEQ.ID.NO.6. 
The construction of plasmid pB100M3' is shown in Figure 9. 
15 Transfo rmation and Regeneration 

Transgenic B. carinata plants expressing the 
oleosin-metallothionein fusion were created using transformation and 
regeneration protocols as described in Example 1. 
Oil Body Preparation 

Washed oil bodies were prepared from B. carinata seeds of 
transgenic and control plants as described in Example 1. 
Removal of Cd-Hr From Solution Usin^ an Oi l Body Affinity 

The use of the oleosin-metallothionein fusion to bind 
cadmium ions in solution is shown schematically in Figure 10. 

A solution of 10 uM CdCl 2 in 10 mM Tris-HCl, pH 7.2 
containing 0.01 uCi/ml ™Cd was prepared. A 1 ml aliquot of this CdCl 2 
solution was thoroughly mixed with 100 ul of washed oil bodies (1.6 mg 
oil body protein) prepared from seeds expressing the 
oleosin-metallothionein fusion protein and incubated at 22°C for 1 hr. 
Following centrifugation for 5' at 10,000 xg to separate the oil bodies from 
the aqueous phase and 2 washes in 1 ml of 10 mM Tris-Cl, pH 7.2, the 
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amount of i09 Cd++ remaining bound to oil body fraction was determined 
using a gamma-counter (Cobra auto-gamma, Canberra Packard, Canada). 
An identical experiment was performed with oil bodies from 
non-transgenic seeds to detect and correct for non-specific binding of Cd 
ions to the matrix. 

Cd + ^ ions were eluted from the oil body metallothionein 
affinity matrix by mixing of the oil body fraction with 1 ml of 100 mM 
glycine (pH = 3.0) buffer (Pazirandeh et ah, 1995; Appl. Microbiol. 
Biotechn. 43: 1112-1117). Following centrifugation for 5 min. At 10,000 
xg, the oil body fraction was removed and assayed for bound Cd+ + ions as 
above. Figure 11 shows Cd binding and elution from the affinity matrix. 
EXAMPLE S 

Separation of Whole Cells 

The following example illustrates the capacity of oil bodies 
15 to immobilize whole cells. One potential for the use of bacterial cell 
separation lies in the utility for diagnostics. It is also desirable to separate 
unique eukaryotic cells such as lymphocytes and stem cells from complex 
mixtures of cells where the cell type of interest is present in relatively 
low numbers. 

Binding of Staphylococcus aureus tn nil bodies via protein A 

For the purpose of this example, S. aureus cells, which 
express protein A as a surface antigen were mixed with oil bodies with 
varying amounts of polyclonal anti-oleosin antibodies. 
Preparation n f 0 j\ bodies 

Seeds of B. napus cv Westar were surface sterilized in 
bleach, rinsed and ground with a mortar and pestle in grinding buffer (50 
mM Tris pH 7.5, 0.4 M sucrose and 100 mM glycine). The homogenate 
was filtered through Miracloth into sterile 15 ml Corex tubes. The 
filtered homogenate was then centrifuged for at 4°C for 10 min at 10,000 
xg. The oil body fraction was removed and resuspended in 50 mM Tris 
pH 7.5 and 0.4 M sucrose and washed two times using the same buffer. 
Aliquots of 1 ml oil bodies were transferred to 1.5 ml Eppendorf tubes 
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and centrifuged at room temperature for 10 min at 16,000 xg. The oil 
bodies were washed in 50 mM Tris pH 7.5 and 0.4 M sucrose 5-6 more 
times until no visible pellet was observed. 
Binding of S. aureus cel ls to anti-oleosin coated oil bodies 
> Formalin fixed S. aureus cells (Sigma, P-7155) were washed 

3-4 times in 50 mM Tris-Cl pH 7.5. and resuspended. Washed oil bodies 
(300 ul) and S. aureus cells (were mixed with varying amounts of anti- 
oleosin IgGs (50 ul). After mixing and incubating at room temperature for 
2 hrs, the mixtures were centrifuged at room temp at 16,000 xg for 5 min. 
The oil body fraction and unternatant were carefully removed and the 
cell pellet was washed twice in 1 ml 50 mM Tris-Cl pH 7.5. The walls of 
the tube were wiped with a tissue to remove traces of oil. Subsequently 
the drained cell pellets were resuspended in 1 ml of water and the OD 600 
were determined. Figure 12 is a representative experiment showing the 
decrease in the amount of cells present in the cell pellet as the 
concentration of anti-IgGs present in the oil-body S. aureus mixture 
increases. 

Differential Binding of Two Strai n s of Stap h ylococcus num.* 

In this experiment an oil body affinity matrix is employed to 
demonstrate differential binding of two strains of Staphylococcus aureus. 
Formalin fixed S. aureus strains, one expressing the IgG binding surface 
antigen protein A and one lacking protein A, are commercially available 
from Sigma. Dilute aliquots of both S. aureus strains of equal OD 550 
could be prepared. To each of these aliquots control oil bodies from 
untransformed plants or oil bodies mixed with anti-oleosin antibodies 
could be added. Following incubation for an appropriate length of time 
at an appropriate temperature, the samples could be centrifuged to pellet 
unbound bacterial cells and to separate the oil body fraction. The oil 
bodies could be decanted, vortexed and the OD 550 could be determined. 
The pellets could be resuspended and the OD 550 of the unternatant could 
be determined. It is anticipated that only in the sample containing the S. 
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aureus strain expressing protein A and the oil body complexed with 
anti-oleosin antibodies, fractionation of these cells to the oil body fraction 
will be observed. Binding of the cells to the oil body could be further 
demonstrated by lowering of the pH of the oil body fraction. Subsequent 
to centrifugation the release of cells from the oil bodies could be 
evidenced by the presence of a pellet and /or an increase in OD S50 upon 
resuspension of the pellet. 

Separation of Stap hylncnrr.us aureus from E. rnli 

A viable S. aureus strain could be mixed with varying 
quantities of cells of an E. coli strain having a specific antibiotic 
resistance. The mixed bacterial sample could be vortexed with control 
antibodies and with oil bodies which have been complexed with 
anti-oleosin antibodies. After incubation for an appropriate length of 
time and at an appropriate temperature oil bodies could be washed and 
the unternatant and oil bodies could be directly titrated and selectively 
plated on blood agar for S. aureus growth and on LB plates for £. coli 
growth. The enrichment or actual separation obtained could be 
determine by an estimate of colony forming units. 

Identification of Patho gens Pres e nt in Low Cnnr»nf rat ions in a Pnmp i.v 
Mixture 

For diagnostic purposes it is often desirable to concentrate 
bacterial or viral pathogens which invade human or animal tissues in 
low numbers. An oil body affinity matrix could be used to enrich for 
these pathogens, so that they could subsequently be identified and 
characterized. 

Pathogens often specifically bind to human or animal cells 
through the interaction with a receptor or surface proteir Oleosin could 
be fused to the human or animal protein ligand and recombinant oil 
bodies could be employed to immobilize the pathogens. Examples of the 
formation of protein complexes formed between proteins of human and 
pathogenic origins known to the prior art include: human fibrinogen or 
fibrin specific domains which bind to S. aureus protein clumping factor 
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A (clf-A) (McDevitt et al. 1995; Mol. Microbiol. 16; 895-907); human decay 
accelerating factor (DAF) to which urinary and intestinal tract pathogenic 
E. Coli bind (Nowicki et al. 1993: /. Of Experim. Med. 178: 2115-2121); a 
human cell ligand which is expressed in the carcinoma cell line Caco-2 
5 and which binds uniquely to the 28 kD Klebsiella pneumoniae fimbria 
protein KPF-28 (Di Maretino et al., 1996; Infect, and Immun. 64: 
2263-2266) and human cell extracellular matrix fibronectin specific 
domains which complex specifically with Streptococcus pyrogenes 
adhesin (protein F) (Ozeri et al, 1996; EMBO J. 15: 989-998). 
10 EXAMPLE 6 

Separation of Small Organic Molecules 

This example describes how an oil body affinity matrix may 
be used for the recovery/ removal of small organic molecules from 
solution. By way of example, the small organic molecule, biotin, is 
15 purified using avidin as a ligand. 
Construc tion of Avidin Ligands 

Avidin is a protein synthesized by avian species and exhibits 
an extremely high affinity for biotin, a natural co-factor for many 
carboxylases. Preparations of purified avidin (commercially available, 
from Sigma) can be conjugated chemically to anti-oleosin antibodies 
using standard procedures known to those skilled in the art. This 
approach would yield a bivalent avidin ligand suitable to demonstrate 
affinity based removal of biotin. Alternatively, an oleosin-avidin gene 
fusion may be utilized. The gene encoding avidin in chicken (Gallus 
? 5 gallus) has been identified and its sequence has been determined (Beattie 
et al., 1987, Nucl Acids Res. 15: 3595-3606). Based on the sequence the 
gene for avidin could be synthesized chemically or through the PCR and 
fused to the B. napus oleosin (van Rooijen, 1993, Ph.D. Thesis, 
University of Calgary) as described in example 4. Streptavidin, an 
0 analogous bacterial biotin binding protein, could also be employed. 
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Oil Body Prep aration 

Washed oil bodies would be prepared from seeds of 
transgenic plants and /or control plants as described in example 1. 
Binding of Biva lent AviHjn-Qleosin Liganrf 

Binding of anti-oleosin antibodies and removal of unbound 
ligand will be as detailed in example 3. 

Removal nf Binfrin, f T nm Snliifinr. 

Solutions containing known concentrations of biotin could 
be combined with a fixed amount of oil bodies complexed with an 
anti-oleosin antibodies conjugated with avidin. Following binding, the 
mixture would be centrifuged to separate oil body and aqueous fraction. 
The amount of biotin remaining in the aqueous fraction is determined 
by competitive ELISA using anti-biotin antibodies conjugated to horse 
radish peroxidase (HRP). The amount of bound biotin may be calculated 
assuming: 

[bound biotin] = [total biotin] - [free biotin] 
From the obtained values, the dissociation constants can be 
determined as described in example 2. As a control, an identical 
experiment could be performed with oil bodies bound to anti-oleosin 
antibodies which have not been conjugated with avidin. If desired, 
biotin could be released from the oil body-avidin matrix through 
competitive elution using an excess of 2- (4'-hydroxy benzene) benzoic acid 
(HABA). Elution may also be aided by employing a genetically 
engineered mutant of avidin which exhibits a lower affinity for biotin. 
Such mutants have been described for the analogous biotin binding 
protein from bacteria, streptavidin (Chilkoti et al., 1995; Bio/Technol. 13 
1198-1204). 
EXAMPT F 7 

Separation of Carbohydrates 

The following example describes the utility of oil body 
matrices for the recovery of carbohydrates from complex biological 
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mixtures. In this example the inventors demonstrate that an oil body 
immobilized cellulase is capable of binding cellulose. 
Oleosin-Cplliilnc q Binding Domain Fusion 

Several of the cellulases produced by the bacterium 
Cellulomonas fimi contain discrete cellulose binding domains (CBDs). 
These CBDs independently bind to cellulose even when they are 
separated by proteolytic cleavage or genetic manipulation from the 
catalytic domain of the enzyme. Plasmid pUC18-CBDPT contains a 
fragment coding for the CBD of the beta-l,4-glucanase (Gilkes et al., 1992, 
Journal of Biol. Chem. 267: 6743-6749) and could be used to construct an 
oleosin-CBD gene fusion. A DNA fragment encoding the CBD domain 
could be isolated from pUC18-CBDPT using appropriate restriction 
enzymes or using the PCR. Alternatively, the CBDs of other cellulases 
from C. Fimi or cellulases from other sources could be used. An oleosin' 
gene from B. Napus isolated from a cDNA library (van Rooijen, 1993, 
Ph.D. Thesis, University of Calgary) was cloned in pGN using the PCR 
and yielding plasmid pOLEGN as described in example 4. An in-frame 
gene fusion between the oleosin gene and the CBD gene could be f 
generated using standard molecular techniques known to those skilled in 
the art. The final construct would comprise the CBD domain 
translationally fused immediately downstream of the oleosin. 
Transformation and Regeneration 

In order to introduce the fusion gene construct in plants, it 
would be subcloned in a binary vector, such as pCGN1559. Transgenic 
plants which express the oleosin-CBD fusion could be generated as 
described in example 1. 
Oil Body Preparation 

Washed oil bodies could be prepared from the seeds of 
transgenic and control wild type plants as described in example 1. 
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Removal of Cellulose from Sol u tion Using an Oil Rnrfy Affinity 
Matrix 

In order to evaluate binding of cellulose to the oil body 
affinity matrix, the binding capacities of oil bodies of wild type and 
transgenic plants are compared. Oil bodies could be mixed with 
appropriately buffered solutions containing a range of cellulose 
concentrations. The oil body suspension could then be incubated for an 
appropriate length of time and at an appropriate temperature. Upon 
centrifugation, the unternatant could be recovered and assayed for 
cellulose concentrations. The concentrations bound cellulose and free 
cellulose could be calculated assuming: 

[bound cellulose] = [total cellulose] - [free cellulose] 
The ratio of the concentration bound over the concentration 
free cellulose could be plotted as a function of the concentration of bound 
cellulose. From these plots dissociation constants could be calculated 
following standard procedures (Scatchard, G. Ann. N. Y. Acad. Sci. 
(1949) 57: 660-672) and as detailed in example 2. 
EXAMPT F ft 

Separation of Nucleic Acids 

The following example describes a method in which oil 
bodies are employed to bind single stranded (SS) nucleic acids. 
Isolation of SinftlP St randed NWl e j c Arid* 

A method for capturing SS nucleic acids may be used in 
diagnostics, such as plant viral disease, or in research applications where 
non-reannealed SS nucleic acids need to be selectively removed from 
solutions such as in hybridization reactions for differential screening of 
expressed genes. Oleosins could be fused with SS DNA or RNA binding 
proteins or specific domains thereof and could be used to trap SS nucleic 
acids for identification or further amplification. Well characterized SS 
nucleic acid binding proteins include: Agrobacterial Ti plasmid Vir E2 
protein (Zupan et al., 1995, Plant Physiol. 107: 1041-1047); Tobacco 
Mosaic Virus (TMV) movement protein P30 (Citovsky et al., 1990; Cell 
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60: 637-647; Waigmann et al., 1994 Proc Natl. Acad. Sci (USA) 91: 
1433-1437); Cauliflower Mosaic Virus coat protein (Thompson et al, 
1993; /. Gen. Virol 74: 1141-1148) and E. Coli RecA and single stranded 
binding (SSB) proteins (Radding, 1991 /. Biol. Chem. 266: 5355-5358). 
EXAMPT F Q 

Separation nf Recombinant Proteins 

The following example further demonstrates the utility of 
an oil body affinity matrix for the purification of recombinant target 
proteins. For the purpose of this example, the IgG /protein A ligand pair 
has been chosen. The construct employed consists of a protein A domain 
which was fused to the 18 kDa Arabidopsis oleosin (Van Rooijen et al, 
1992; Plant Mol. Biol. 18: 1177-1179). Oil bodies containing oleosin- 
protein A fusion proteins were isolated and used to demonstrate specific 
binding of rabbit-anti-mouse IgGs conjugated to Horse Raddish? 
Peroxidase (HRP). The configuration of the oleosin-protein A fusion on 
the oil body and binding of IgG to the fusion is shown in Figure 15. . 
The Oleo sin-Protein A Fusion 

A synthetic protein A sequence encoding a protein capable 
of binding to IgG was synthesized based on reported sequence 
information (pRIT2T, protein A gene fusion vector; Pharmacia) and was 
amplified through the PCR. Each primer used in the PCR contained 
restriction sites 5' to the protein A-specific sequence in order to facilitate 
cloning. The reverse primer (i.e. the primer in the antisense direction) 
also contained a translational stop codon following the coding sequence. 
Fig 13 shows the position of the PCR primers relative to the protein A 
sequence. (The protein A sequence and the primer sequences are also 
separately shown in SEQ.ID.NO:8, SEQ.ID.NO:10 and SEQ.ID.NO:ll 
respectively). The resulting fragment was ligated into a pUC19 plasmid 
carrying the Arabidopsis oleosin gene comprised of an 867 bp upstream 
promoter region followed by the coding region (with its associated 
intron) from which the translational stop codon had been removed. The 
3' end of the construct contains the nopaline synthase transcriptional 
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terminator. A spacer sequence encoding a recognition sequence for the 
endoprotease thrombin was incorporated immediately downstream of 
the oleosin coding sequence. The protein A gene sequence was 
introduced between this spacer sequence and the terminator sequence. In 
the final expression construct the oleosin and protein A coding regions 
were fused in the same reading frame. The entire construct (Figure 14 
and SEQ.ID.NO-.12) was then excised from the pUC19 plasmid and 
subcloned into the plant transformation vector pCGN1559 (McBride and 
Summerfelt, 1990, Plant Mol. Biol. 14: 269-276) carrying a neomycin 
phosphotransferase gene under the control of the 35S CaMV promoter. 
The resulting plasmid was introduced in Agrobacterium (strain EHA101). 
Transf ormation and Regeneration 

Plants were transformed and regenerated as described in 
example 1. Transgenic plants were initially identified using a neomycin 
phosphotransferase assay and subsequently confirmed by expression of 
protein A fusions through immunbblot analysis. 
Preparation of Oil Bodies 

Oil bodies from the transgenic B. napus and B. carinata lines 
expressing the oleosin-protein A fusion were prepared following the 
procedure described in example 1. 
Binding of Oleosin- Protein A Fusions to f g r: 

Oil body protein extracts (20 ug/ aliquot) from various 
transgenic B. napus lines expressing oleosin-protein A fusion proteins 
were subjected to polyacrylamide gelelectrophoresis and subsequently 
transferred to a PVDF membrane following standard procedures. The 
membrane was then probed with a HRP-conjugated mouse anti-rabbit 
antibody and visualised following the procedure as outlined in 
Antibodies, a laboratory manual (Harlow and Lane, 198.8, Cold Spring 
Harbor). In Figure 16 the stained PVDF membrane is shown. A 50 kDa 
protein (predicted molecular mass of the oleosin-protein A fusion 
protein: 48,801 Da) was specifically detected in the protein extracts of all of 
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the six transgenic B. napus lines tested. Untransformed control plants did 
not exhibit HRP activity, while the a 30 kDa protein (predicted molecular 
mass 29,652 Da) was present in a bacterial lysate transformed with pRITZT 
encoding protein A and undetectable in the untransformed lysate. 
Binding and Elution of IeGs to Oil Bodies Con taining Ol- osin-ProtPin A 
Fusion Proteins 

Washed oil bodies (10 mg/ml protein) were prepared from 
• wildtype B. napus and a transgenic B. napus line transformed with a 
construct expressing an oleosin-protein A fusion protein as described in 
example 1 and suspended in 10 mM Tris-Cl pH 8.0. A volume of 2 ul (±34 
ug) of HRP-conjugated rabbit anti-mouse antibodies (Sigma, cat no 
A9044) was added to 500 ul of the washed oil body preparation and the 
suspension was incubated for 1 hr at room temperature or overnight at 
4°C. The samples were then centrifuged for 15 min at 16,000 xg and the 
undernatant was removed. Subsequently, the oil bodies were thoroughly 
resuspended in 500 ul 10 mM Tris-Cl pH 8.0 using a pestle. This washing 
step in Tris-Cl was repeated 4 times (henceforth termed washed oil body 
preparation). A 5 ul aliquot from the washed oil body preparation was 
washed a fifth time and then assayed for HRP activity. 

HRP assays were carried out by adding 1 ul of the washed oil 
body preparation to 1 ml of HRP assay mix (9.8 ml of 0.1 M NaOAc, 0.2 ml 
of 2.5 mg/ml Trimethylbenzidine in DMSO, 4 ul H 2 0 2 ) and incubating 
the mixture for 5 min at room temperature. The reaction was then 
stopped by adding 0.5 ml 1M H 2 S0 4 . The samples were filtered through a 
0.22 urn filter and subsequently the OD 450 's were determined 
spectrophotometrically. 

In order to elute the IgGs from the oil bodies, the washed oil 
body preparation was resuspended in 100 mM glycine pH 3.0 and 
centrifuged for 15 min at 16,000 xg and incubated for 30' at room 
temperature. Following neutralization in 500 ul 100 mM Tris-Cl pH 8.0, 
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both the oil body fraction and the eluate were assayed for HRP activity as 
above. The binding and elution of IgGs to oil bodies from weld type B. 
napus and transgenic B. napus expressing an oleosin protein A fusion, 
are illustrated in Figure 17. 

All publications, patents and patent applications are herein 
incorporated by reference in their entirety to the same extent as if each 
individual publication, patent or patent application was specifically and 
individually indicated to be incorporated by reference in its entirety. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: 

(A) NAME: SemBioSys Genetics Inc 

<B) STREET: 609-14 Street, N.W. 

(C) CITY: Calgary 

(D) STATE: Alberta 

(E) COUNTRY: Canada 

(F) POSTAL CODE: T2N 2A1 

(G) TELEPHONE NO.: (403) 220-5161 

(H) TELEFAX NO.: (403) 220-0704 

(A) NAME: Moloney, Maurice 

(B) STREET: 34 Edgebrook Cove N W 
<C) CITY: Calgary 

(D) STATE: Alberta 

(E) COUNTRY: Canada 

(F) POSTAL CODE: T3A 5N5 

(A) NAME: Boo the , Joseph 

(B) STREET: #302, 332 6th Avenue N W 

(C) CITY: Calgary 

(D) STATE: Alberta 

( E ) COUNTRY : Canada 

(F) POSTAL CODE: T2E 0L9 

(A) NAME: Rooijen, Gijs Van 

(B) STREET: 32 2 3 Bearspaw Drive N.W 

(C) CITY: Calgary 

(D) STATE: Alberta 

( E ) COUNTRY : Canada 

(F) POSTAL CODE: T2L 1T1 

(ii) TITLE OF INVENTION: Oil Bodies and Associated Proteins as 
Affinity Matrices 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: BERESKIN & PARR 

(B) STREET: 40 King Street West 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) ZIP : M5H 3Y2 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

( C ) CLASSIFICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/767,026 

(B) FILING DATE: 1S-DEC-1997 



(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: Gravelle, Micheli 
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(B) REGISTRATION NUMBER: 40,261 

(C) REFERENCE / DOCKET NUMBER: 9369-050 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (416) 364-7311 

(B) TELEFAX: (416) 361-1398 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oleosin From Arabidopsis Thali 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 522 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATG GCG GAT ACA GCT AGA GGA ACC CAT CAC GAT ATC ATP rrr ana r*r* 
Met Ala Asp Thr Ala Arg Gly Thr His SiS ill J£ ™ 

5 10 15 

CAG TAG CCG ATG ATG GGC CGA GAC CGA GAC CAG TAC CAG ATG TCC CCA 
Gin Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr G^n Set sir Gly 
20 25 30 

Ira Civ 1*1 a AC I™ c CC ^ TCT AGG CAG ATT GCT AAA GCT GCA ACT 144 
Arg Gly Ser Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Ala Thr 

Jb 40 45 

S ill ?S S SS SJ m SS S 2S IS J5 £ U T S£ £S 

51 SJ i£ SS ili S 2S iE SI Si JS S S 2S SS in 2 " 

70 75 80 

£ £ £5 ilS S S5 S 2S S JS SI S 2S 2S SIS 288 
S £ S S S S |g s « {« « J,. ™ ; „ 336 

52 S« IS S - « I- S *" - - - CCA « |? , JCJ 384 

12 0 125 

2S £ 2S S5 iS S £f SS £ 2S i£ ^ SI as S« ' «> 

& ss s s ss SS SiS Si a: }g ss 35 Si «• 
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145 150 155 160 

CAT GAC CGT GAC CGT ACT CGT GGT GGC CAG CAC ACT ACT TAA 500 

His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr * 

165 170 



(2) INFORMATION FOR SEQ ID NO ; 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ala Asp Thr Ala Arg Gly Thr His His Asp He He Gly Arg Asp 
1 5 10 15 

Gin Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly 
20 25 30 

Arg Gly Ser Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Ala Thr 
35 40 45 

Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu 
50 55 60 

Val Gly Thr Val He Ala Leu Thr Val Ala Thr Pro Leu Leu Val He 
65 70 75 80 

Phe Ser Pro He Leu Val Pro Ala Leu He Thr Val Ala Leu Leu He 
85 90 95 

Thr Gly Phe Leu Ser Ser Gly Gly Phe Gly He Ala Ala He Thr Val 
100 105 HO 

Phe Ser Trp He Tyr Lys Tyr Ala Thr Gly Glu His Pro Gin Gly Ser 
115 120 125 

Asp Lys Leu Asp Ser Ala Arg Met Lys Leu Gly Ser Lys Ala Gin Asp 
130 135 140 

Leu Lys Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu 
14b I 50 155 160 

His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr * 
165 170 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2115 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oleosin - Hirudin ■ Fusion 

( ix ) FEATURE : 

(A) NAME /KEY : CDS 
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(B) LOCATION: 862.. 1215 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1456.. 1833 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CTATACCCAA CCTCGGTCTT GGTCACACCA GGAACTCTCT GGTAAGCTAG CTCCACTCCC 

CAGAAACAAC CGGCGCCAAA TTGCCGGAAT TGCTGACCTG AAG AC GGAAC ATCATCGTCG 

GGTCCTTGGG CGATTGCGGC GGAAGATGGG TCAGCTTGGG CTTGAGGACG AGACCCGAAT 

CGAGTCTGTT GAAAGGTTGT TCATTGGGAT TTGTATACGG AGAT7GGTCG TCGAGAGGTT 

TGAGGGAAAG GACAAATGGG TTTGGCTCTG GAGAAAGAGA GTGCGGCTTT AGAGAGAGAA 

TTGAGAGGTT TAGAGAGAGA TGCGGCGGCG ATGACGGGAG GAGAGACGAC GAGGACCTGC 

ATTATCAAAG CAGTGACGTG GTGAAATTTG GAACTTTTAA GAGGCAGATA GATTTATTAT 

TTGTATCCAT TTTCTTCATT GTTCTAGAAT GTCGCGGAAC AAATTTTAAA ACTAAATCCT 

AAATTTTTCT AATTTTGTTG CCAATAGTGG ATATGTGGGC CGTATAGAAG GAATCTATTG 

AAGGCCCAAA CCCATACTGA CGAGCCCAAA GGTTCGTTTT GCGTTTTATG TTTCGGTTCG 

ATGCCAACGC CACATTCTGA GCTAGGCAAA AAACAAACGT GTCTTTGAAT AGACTCCTCT 

CGTTAACACA TGCAGCGGCT GCATGGTGAC GCCATTAACA CGTGGCCTAC AATTGCATGA 

TGTCTCCATT GACACGTGAC TTCTCGTCTC CTTTCTTAAT ATATCTAACA AACACTCCTA 

CCTCTTCCAA AATATATACA CATCTTTTTG ATCAATCTCT CATTCAAAAT CTCATTCTCT 

CTAGTAAACA AGAACAAAAA A ATG GCG GAT ACA GOT AGA GGA ACC CAT CAC 

Met Ala Asp Thr Ala Arg Gly Thr His His 
15 10 

HI tT° AGA gac cag tac CCG ATG ^ GGC CGA GAC CGA GAC 

Asp lie He Gly Arg Asp Gin Tyr Pro Met Met Gly Arg SJp Sg Sp 

15 20 25 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
891 

939 



1035 



S2 £ 3S IS? 155 SJ S §S III 25 S5 K5 £5 155 iSS 25 

35 40 

rll a?I ^ T ° CA ACT GCT GTC ACA GCT GGT GGT TCC CTC CTT GTT 

-le Ala Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Sal 

3 50 55 

LeS Ser ^ f TT ACC CTT GTT GGA ACT GTC ATA GCT TTG ACT G^T GCA 10 8"! 

Leu Ser Ser Leu Thr Leu Val Gly Thr Val He Ala Leu Thr Val aS " 

Thi Pro r° ? TC GTT ATC TTC AGC CCA ATC CTT GTC CCG GCT CTC ATC 
Thr Pro Leu , eu Val lie Phe Ser Pro He Leu Val Pro 111 w He 

80 85 90 

Th£ vIT G ? A CTC CTC ATC ACC GGT TTT CTT TCC TCT GGA GGG TTT GGC 
Thr val Ala Leu Leu He Thr Gly Phe Leu Ser Ser Gly G!y J£ g£ 



1131 



95 100 105 

ATT GCC GCT ATA ACC GTT TTC TCT TGG ATT TAC AAG TAAGCACACA 



1179 



1225 
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He Ala Ala He Thr Val Phe Ser Trp lie Tyr Lys 
HO 115 

TTTATCATCT TACTTCATAA TTTTGTGCAA TATGTGCATG CATGTGTTGA GCCAGTAGCT 12 8 5 

TTGGATCAAT TTTTTTGGTC GAATAACAAA TGTAACAATA AGAAATTGCA AATTCTAGGG 134 5 

AACATTTGGT TAACTAAATA CGAAATTTGA CCTAGCTAGC TTGAATGTGT CTGTGTATAT 

CATCTATATA GGTAAAATGC TTGGTATGAT ACCTATTGAT TGTGAATAGG TAC GCA 



Tyr Ala 
1 



JES r?f £ CA CAG GGA TCA GAC ^ TTG GAC AGT GCA AGG ATG 

Thr Gly Glu His Pro Gin Gly Ser Asp Lys Leu Asp Ser Ala Arg Met 

5 10 15 

AAG TTG GGA AGC AAA GCT CAG GAT CTG AAA GAC AGA GCT CAG TAC TAC 
Lys Leu Gly Ser Lys Ala Gin Asp Leu Lys Asp Arg Ala Gin Tyr Tv 
20 25 30 



r? £ A 5.^ CAT ACT GGT TGG GAA CAT GAC CGT GAC CGT ACT CGT GGT 
Gly Gin Gin His Thr Gly Trp Glu His Asp Arg Asp Arg Thr Arg Gly 
Ji 40 45 50 



1405 
1461- 



1509 



1557 



1605 



1653 



1701 



1749 



1797 



C C ACT ACT GCG ATC GAA GGG AGA AT C ACT TAC ACT GAC TGT 
Gly Gin riis Thr Thr Ala He Glu Gly Arg He Thr Tyr Thr Asp Cys 
55 60 65 

A S; GAA £ CT GGA CAG CTC TGT CTC TGT GAA GGA TCT AAC GTT TGT 

Thr Glu Ser Gly Gin Asn Leu Cys Leu Cys Glu Gly Ser Asn Val Cys 
70 75 80 

GGA AAG GGA AAC AAG TGT ATC CTC GGA TCT AAC GGA AAG GGA AAC CAG 
Gly Lys Gly Asn Lys Cys He Leu Gly Ser Asn Gly Lys Gly Asn Gin 
85 90 95 

SI Sal c w r? A £ T ^ C CCA GAA TCT CAC ^ C GGA 

cys Val .hr Gly Glu Gly Thr Pro Asn Pro Glu Ser His Asn Asn Gly 

100 105 HO 

ll C ^ GAA ATC CCT GAA GAA TAC CTC CAG TAA GTCGACTCTA 184^ 
Asp Phe Glu Glu He Pro Glu Glu Tyr Leu Gin * 1843 
115 120 125 

GACGGATCTC CCGATCGTTC AAACATTTGG CAATAAAGTT TCTTAAGATT GAATCCTGTT 
GCCGGTCTTG CGATGATTAT CATATAATTT CTGTTGAATT ACGTTAAGCA TGTAATAATT 
AACATGTAAT GCATGACGTT ATTTATGAGA TGGGTTTTTA TGATTAGAGT CCCGCAATTA 
TACATTTAAT ACGCGATAGA AAACAAAATA TAGCGCGCAA ACTAGGATAA ATTATCGCGC 
GCGGTGTCAT CTATGTTACT AGATCGGAAT TC 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



1903 
1963 
2023 
2083 
2115 
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Met Ala Asp Thr 

1 



Gin Tyr Pro Met 
20 

Arg Gly Ser Asp 
35 

Ala Val Thr Ala 
50 

Val Gly Thr Val 
65 



Phe Ser Pro lie 



Thr Gly Phe Leu 
100 

Phe Ser Trp lie 
115 



Ala Arg Gly Thr 
5 



Met Gly Arg Asp 



Tyr Ser Lys Ser 
40 

Gly Gly Ser Leu 
55 

He Ala Leu Thr 
70 



Leu Val Pro Ala 
85 



Ser Ser Gly Gly 



Tyr Lys 



His His Asp He 
10 

Arg Asp Gin Tyr 
25 



Arg Gin He Ala 



Leu Val Leu Ser 
60 

Val Ala Thr Pro 
75 

Leu He Thr Val 
90 

Phe Gly He Ala 
105 



He Gly Arg Asp 
15 

Gin Met Ser Gly 
30 

Lys Ala Ala Thr 
45 

Ser Leu Thr Leu 



Leu Leu Val He 
80 

Ala Leu Leu He 
95 

Ala He Thr Val 
110 



(2). INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Tyr Ala Thr Gly Glu His Pro. Gin Gly Ser Asp Lys Leu Asp Ser Ala 

10 15 

Arg Met Lys Leu Gly Ser Lys Ala Gin Asp Leu Lys Asp Arg Ala Gin 
ZU 25 30 

Tyr Tyr Gly Gin Gin His Thr Gly Trp Glu His Asp Arg Asp Arg Thr 

40 45 

Arg Gly Gly Gin His Thr Thr Ala. lie Glu Gly Arg He Thr Tyr Thr 



Asp Cys Thr Glu Ser Gly Gin Asn Leu Cys Leu Cys Glu Gly Ser Asn 

75 80 

Val Cys Gly Lys Gly Asn Lys Cys He Leu Gly Ser Asn Gly Lys Gly 

85 90 95 

Asn Gin Cys Val Thr Gly Glu Gly Thr Pro Asn Pro Glu Ser His Asn 

105 110 

Asn Gly Asp Phe Glu Glu He Pro Glu Glu Tyr Leu Gin * 
115 120 125 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 23 6 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oleosin - Metallothionein Fusion 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(3) LOCATION: 1092.. 1856 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

GAGCTCAAAT ACGATCTGAT ACTGATAACG TCTAGATTTT TAGGGTTAAA GCAATCAATC 

ACCTGACGAT TCAAGGTGGT TGGATCATGA CGATTCCAGA AAACATCAAG CAAGCTCTCA 

AAGC T AC ACT CTTTGGGATC ATACTGAACT CTAACAACCT CGTTATGTCC CGTAGTGCCA 

GTACAGACAT CCTCGTAACT CGGATTATGC ACGATGCCAT GGCTATACCC AACCTCGGTC 

TTGGTCACAC CAGGAACTCT CTGGTAAGCT AGCTCCACTC CCCAGAAACA ACCGGCGCCA 

AATTGCCGGA ATTGCTGACC TGAAGACGGA ACATCATCGT CGGGTCCTTG GGCGATTGCG 

GCGGAAGATG GGTCAGCTTG GGCTTGAGGA CGAGACCCGA ATCGAGTCTG TTGAAAGGTT 

GTTCATTGGG ATTTGTATAC GGAGATTGGT CGTCGAGAGG TTTGAGGGAA AGG AC AAATG 

GGTTTGGCTC TGGAGAAAGA GAGTGCGGCT T T AG AGAGAG AATTGAGAGG TTTAGAGAGA 

GATGCGGCGG CGATGACGGG AGGAGAGACG ACGAGGACCT GCATTATCAA AGCAGTGACG 

TGGTGAAATT TGGAACTTTT AAGAGGCAGA TAGATTTATT ATTTGTATCC ATTTTCTTCA 

TTGTTCTAGA ATGTCGCGGA ACAAATTTTA AAACTAAATC CTAAATTTTT CTAATTTTGT 

TGCCAATAGT GGATATGTGG GCCGTATAGA AGGAATCTAT TGAAGGCCCA AACCCATACT 

GACGAGCCCA AAGGTTCGTT TTGCGTTTTA TGTTTCGGTT CGATGCCAAC GCCACATTCT 

GAGCTAGGCA AAAAACAAAC GTGTCTTTGA ATAGACTCCT CTCGTTAACA CATGCAGCGG 

CTGCATGGTG ACGCCATTAA CACGTGGCCT ACAATTGCAT GATGTCTCCA TTGACACGTG 

ACTTCTCGTC TCCTTTCTTA ATATATCTAA CAAACACTCC TACCTCTTCC AAAATATATA 

CACATCTTTT TGATCAATCT CTCATTCAAA ATCTCATTCT CTCTAGTAAA CAGGATCCCC 

CTCGCGGCCG C ATG GCG GAT ACA GCT AGA ACC CAT CAC GAT GTC ACA AGT 
Met Ala Asp Thr Ala Arg Thr His His Asp Val Thr ter 



CGA GAT CAG TAT CCC CGA GAC CGA GAC CAG TAT TCT ATG ATC CCT rr* 
Arg Asp Gin Tyr Pro Arg Asp Arg Asp Gin Tyr sir Met ill J£ £J 
■ L3 20 25 

S l^l ^ AC ^AG TAC TCT ATG ATG GGC CGA GAC CGA GAC CAG TAC AAC 
Asp Arg Asp Gin Tyr Ser Met Met Gly Arg Asp Arg Asp Gin Tyr 

35 40 



Se? Tv3 r?7 P A ? AC TAC TCC ^ G TC T AGA CAG ATT GCT AAG GCT GTT 
Met Tyr Gly Arg Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Val 

50 55 60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1130 

1178 

1226 

1274 
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ACC GCA GTC ACG GCG GGT GGG TCC CTC CTT GTC CTC TCC AGT CTC ACC mo 
Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu £hr 
65 70 75 

CTT GTT GGT ACT GTC ATT GCT TTG ACT GTT GCC ACT CCA CTC CTC GTT mn 
Leu Val Gly Thr Val He Ala Leu Thr Val Ala Thr Pro Su Leu Sal 13? ° 
80 85 90 

ATC TTT AGC CCA ATC CTC GTG CCG GCT CTC ATC ACC GTA GCA CTT CTC 141 r 

He Phe Ser Pro He Leu Val Pro Ala Leu lie Thr VaY SEla ill III ^ 

y=> ioo — 



105 



ATC ACT GGC TTT CTC TCC TCT GGT GGG TTT GCC ATT GCA GCT ATA ACC 
lie Thr Gly Phe Leu Ser Ser Gly Gly Phe Ala He All til lli ?hr 

115 120 125 

GTC TTC TCC TGG ATC TAT AAG TAC GCA ACG GGA GAG CAC CCA CAG GGG 
Val Phe Ser Trp lie Tyr Lys Tyr Ala Thr Gly Glu His Pro gi? afy 



140 



TCA GAT AAG TTG GAC AGT GCA AGG ATG AAG CTG GGA ACC AAA GCT CAG 
Ser Asp Lys Leu Asp Ser Ala Arg Met Lys Leu Gly Thr JJJ Sa cin 

150 155 

GAT ATT AAA GAC AGA GCT CAA TAC TAC GGA CAG CAA CAT ACA GGT CCT 
Asp n e Lys Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His J£ Gil Gil 



170 



1466 



1514 



1562 



1610 



Gin StI * GT GAC CGT ACT CGT GGT GGC CAG CAC ACT ACT CTC GTT 165ft 

Glu His Asp Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr Leu Sal 
3 180 185 

Pro ira St £2 ^ 0°° ^ TG ° TCC TGT GCC GCC AGT ^C TCC 

Fro Arg Gly Ser Met Asp Pro Asn Cys Ser Cys Ala Ala Ser Asp Ser 

195 200 205 

Ss t"S £S a? 0 ^ 1 CC TGC AAG TGC GAG TGC ™A T GC ACC TCC 

Cys Thr Cys Ala Gly Ser Cys Lys Cys Lys Glu Cys Lys Cys Thr Ser 

210 215 220 



1706 



1754 



c T S £ & JS £ S £ lyl S £ ^ S £ cyl 

230 235 

ss a; is s ?n ss si ss s s as & ss is si "« 

245 250 
GCC TGA GCGGCCGCGA GGGC TGC AGA ATGAGTTCCA AGATGGTTTG TGACGAAGTT . 1 90 6 

255 

AGTTGGTTGT TTTTATGGAA CTTTGTTTAA GCTTGTAATG TGGAAAGAAC GTGTGGCTTT 
GTGGTTTTTA AATGTTGGTG AATAAAGATG TTTCCTTTGG ATTAACTAGT ATTTTTCCTA 
TTGGTTTCAT GGTTTTAGCA CACAACATTT TAAATATGCT GTTAGATGAT ATGCTGCCTG 
CTTTATTATT TACTTACCCC TCACCTTCAG TTTCAAAGTT GTTGCAATGA CTCTGTGTAG 
TTTAAGATCG AGTGAAAGTA GATTTTGTCT ATATTTATTA GGGGTATTTG ATATGCTAAT 
GGTAAACATG GTTTATGACA GCGTACTTTT TTGGTTATGG TGTTGACGTT TCC TTTTAAA 
CATTATAGTA GCGTCCTTGG TCTGTGTTCA TTGGTTGAAC AAAGGCACAC TCACTTGGAG 



1966 
2026 
2086 
2146 
2206 
2266 
2326 
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ATGCCGTCTC CACTGATATT TGAACAAAGA ATTCGGTACC 23 6 1 

(2) INFORMATION FOR SEQ ID NO : 7 : 

C i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 55 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Ala Asp Thr Ala Arg Thr His His Asp Val Thr Ser Arg Asp Gin 
1 5 10 15 

Tyr Pro Arg Asp Arg Asp Gin Tyr Ser Met He Gly Arg Asp Arg Asp ' 
20 25 30 

Gin Tyr Ser Met Met Gly Arg Asp Arg Asp Gin Tyr Asn Met Tyr Gly 
35 40 45 

Arg Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Val Thr Ala Val 
5 0 55 60 

Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu Val Gly 
65 70 75 80 

Thr Val He Ala Leu Thr Val Ala Thr Pro Leu Leu Val He Phe Ser 
85 90 95 

Pro He Leu Val Pro Ala Leu He Thr Val Ala Leu Leu He Thr Gly 
100 105 110 

Phe Leu Ser Ser Gly Gly Phe Ala He Ala Ala He Thr Val Phe Ser 
115 120 125 

Trp lie Tyr Lys Tyr Ala Thr Gly Glu His Pro Gin Gly Ser Asp Lys 
130 135 140 

Leu Asp Ser Ala Arg Met Lys Leu Gly Thr Lys Ala Gin Asp He Lys 
14:3 150 155 160 

Asp Arg Ala Gin Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu His Asp 
165 170 175 

Arg Asp Arg Thr Arg Gly Gly Gin His Thr Thr Leu Val Pro Arg Gly 
180 185 190 

Ser Met Asp Pro Asn Cys Ser Cys Ala Ala Ser Asp Ser Cys Thr Cys 
195 200 205 

Ala Gly Ser Cys Lys Cys Lys Glu Cys Lys Cys Thr Ser Cys Lys Lys 
210 215 220 

Ser Cys Cys Ser Cys Cys Pro Val Gly Cys Ala Lys Cys Ala Gin Gly ' 
225 230 235 240 

Cys He Cys Lys Gly Ala Ser Asp Lys Cys Ser Cys Cys Ala * 
245 250 255 

(2)- INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Protein A Primers 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 5.. 796 



49 



97 



145 



193 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

CTCC ATG GAT CAA CGC AAT GGT TTT ATC CAA AGC CTT AAA GAT GAT CCA 
Met Asp Gin Arg Asn Gly Phe He Gin Ser Leu Lys Asp Asp Pro 
15 10 15 

AGC CAA AGT GCT AAC GTT TTA GGT GAA GCT CAA AAA CTT AAT GAC TCT 
Ser Gin Ser Ala Asn Val Leu Gly Glu Ala Gin Lys Leu j£n 2£ Ter 
20 25 30 

o CA GCT GAT GCG CAA CAA ^ T ^ c TTC AAC AAA GAT CAA 

Gin Ala Pro Lys Ala Asp Ala Gin Gin Asn Asn Phe Asn Lys Asp Gin* 
3b 40 45 

cJn" Ser 111 Ph° l AT TTG ^ ATG CCT ** C TTA AAC GAA GCG 

Gin Ser Ala Phe Tyr Glu He Leu Asn Met Pro Asn Leu Asn Glu Ala 

3U 55 60 

rt^ ^ C GGC TTC ATT CAA AGT CTT AAA G AC GAC CCA AGC CAA AGC 

Gin Arg Asn Gly Phe He Gin Ser Leu Lys Asp Asp Pro" sIt'g™ Ser 

70 75 

5hr itS vll 11? r? T ^ ?? T *** *** TTA AAC GAA TCT CAA GCA CCG 
Th. Asn Val Leu Gly Glu Ala Lys Lys Leu Asn Glu Ser Gin Ala Pro 

85 90 95 

lCJ S?I AAC AAT TTC AAC *** GAA CAA CAA AAT GCT TTC TAT GAA 

Lys Ala Asp Asn Asn Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr gJS 

100 105 110 

ATC TTG AAT ATG CCT AAC TTA AAC GAA GAA CAA CGC AAT GGT TTC ATC 
He Leu Asn Met Pro Asn Leu Asn Glu Glu Gin Arg kin g?J fie He 
115 120 125 

CAA AGC TTA AAA GAT GAC CCA AGC CAA AGT GCT AAC CTA TTG ni 
Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala JJn Leu* Uu s£ 
"* U I 35 140 

AlJ L^t ? TA AAT GAA TCT CAA GCA CCG AAA GCG GAT AAC AAA TTC 

Ala Lys Lys Leu Asn Glu Ser Gin Ala Pro Lys Ala Asp Asn Jyt III 

150 155 

AAC AAA GAA CAA CAA AAT GCT TTC TAT GAA ATC TTA CAT TTA CCT AAC 
Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu He Leu Hxl Let Pro 

165 170 ' 175 

7I A AAC 5^ GAA CAA CGC AAT GGT TTC ATC CAA AGC CTA AAA GAT gap 
LSU ASn G1U G1 " Arg Asn Gly Phe He Gin s2r Uu £yt S£ Sp 

180 185 19 0 

Pro Ser G ? T ^ C CTT ? TA GCA GAA GCT AAG CTA AAT GAT 

ber Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp 



289 



337 



385 



481 



529 



577 



625 
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195 200 205 

GCT CAA GCA CCA AAA GCT GAC AAC AAA TTC AAC AAA GAA CAA CAA AAT 67 3 

Ala Gin Ala Pro Lys Ala Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn 
210 215 220 

GCT TTC TAT GAA ATT TTA CAT TTA CCT AAC TTA ACT GAA GAA CAA CGT 72^ 
Ala Pne Tyr Glu lie Leu His Leu Pro Asn Leu Thr Glu Glu Gin Arq 
225 230 235 

AAC GGC TTC ATC CAA AGC CTT AAA GAC GAT CCG GGG AAT TCC CGG GGA 769 
Asn Gly.Phe He Gin Ser Leu Lys Asp Asp Pro Gly Asn Ser Arg Sly 
240 245 250 255 

TCC GTC GAC CTG CAG ATA ACA AAT TAG AAGCTTGC oha 
Ser Val Asp Leu Gin He Thr Asn * 
260 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met Asp Gin Arg Asn Gly Phe He Gin Ser Leu Lys Asp Asp Pro c er 
15 10 15 

Gin Ser Ala Asn Val Leu Gly Glu Ala Gin Lys Leu Asn Asp Ser Gin 
20 25 30 

Ala Pro Lys Ala Asp Ala Gin Gin Asn Asn Phe Asn Lys Asp Gin Gin 

40 45 

Ser Ala Phe Tyr Glu lie Leu Asn Met Pro Asn Leu Asn Glu Ala Gin 
DU 55 60 

Arg Asn Gly Phe He Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Thr 



70 75 



80 



Asn Val Leu Gly Glu Ala Lys Lys Leu Asn Glu Ser Gin Ala Pro Lys 
85 90 95 

Ala Asp Asn Asn Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu He 
100 105 1 10 

Leu Asn Met Pro Asn Leu Asn Glu Glu Gin Arg Asn Gly Phe He Gin 
115 120 



125 



Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ser Glu Ala 

JU 135 140 

Lys Lys Leu Asn Glu Ser Gin Ala Pro Lys Ala Asp Asn Lys Phe Asn 

150 155 160 

Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu 

165 170 175 

Asn Glu Glu Gin Arg Asn Gly Phe He Gin Ser Leu Lys Asp Asp Pro 

180 185 ^ 



190 
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Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala 
iyb 200 205 

Gin Ala Pro Lys Ala Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala 
210 215 220 

Phe Tyr Glu He Leu His Leu Pro Asn Leu Thr Glu Glu Gin Arg Asn 

230 235 240 

Gly Phe He Gin Ser Leu Lys Asp Asp Pro Gly Asn Ser Arg Gly Ser 
245 250 255 

Val Asp Leu Gin He Thr Asn * 
260 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Primer Bk 266 

(xi) SEQUENCE DESCRIPTION: SEQ ID" NO: 10: 
CTCCATGGAT CAACGCAATG GTTTATC 

27 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Primer Bk267 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCAAGCTTCT AATTTGTTAT CTGCAGGTC 

29 

(2) INFORMATION FOR SEQ ID NO: 12: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2709 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 
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(B) LOCATION: 868.. 1220 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1462.. 2436 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CCATGGCTAT ACCCAACCTC GGTCTTGGTC ACACCAGGAA CTCTCTGGTA AGCTAGCTCC 6 0 

ACTCCCCAGA AACAACCGGC GCCAAATTGC CGGAATTGCT GACCTGAAGA CGGAACATCA 120 

TCGTCGGGTC CTTGGGCGAT TGCGGCGGAA GATGGGTCAG CTTGGGCTTG AGG AC GAG AC 180 

CCGAATCGAG TCTGTTGAAA GGTTGTTCAT TGGGATTTGT ATACGGAGAT TGGTCGTCGA 240 

GAGGTTTGAG GGAAAGGACA AATGGGTTTG GCTCTGGAGA AAGAGAGTGC GGCTTTAGAG 3 00 

AGAGAATTGA GAGGTTTAGA GAGAGATGCG GCGGCGATGA CGGGAGGAGA GACGACGAGG 360 

ACCTGCATTA TCAAAGCAGT GACGTGGTGA AATTTGGAAC TTTTAAGAGG CAGATAGATT 420 

TATTATTTGT ATCCATTTTC TTCATTGTTC TAGAATGTCG CGGAACAAAT TTTAAAACTA 480 

AATCCTAAAT TTTTCTAATT TTGTTGCCAA TAGTGGATAT GTGGGCCGTA TAGAAGGAAT 540 

CTATTGAAGG CCCAAACCCA TACTGACGAG CCCAAAGGTT CGTTTTGCGT TTTATGTTTC 600 

GGTTCGATGC CAACGCCACA TTCTGAGCTA GGCAAAAAAC AAACGTGTCT TTGAATAGAC 660 

TCCTCTCGTT AACACATGCA GCGGCTGCAT GGTGACGCCA TTAACACGTG GCCTACAATT 72 0 

GCATGATGTC TCCATTGACA CGTGACTTCT CGTCTCCTTT CTTAATATAT CTAACAAACA 780 

CTCCTACCTC TTCCAAAATA TATACACATC TTTTTGATCA ATCTCTCATT CAAAATCTCA 84 0 

TTCTCTCTAG TAAACAAGAA CAAAAAA ATG GCG GAT ACA GCT AGA GGA ACC 891 

Met Ala Asp Thr Ala Arg Gly Thr 
1 5 

StI S AC ^ AT A T° ATC GGC AGA GAC CAG TAC CCG ATG ATG GGC CGA GAC 
his His Asp lie He Gly Arg Asp Gin Tyr Pro Met Met Gly Arg Asp 
1U 15 20 

CGA GAC CAG TAC CAG ATG TCC GGA CGA GGA TCT GAC TAC TCC AAG TCT 
Arg Asp Gin Tyr Gin Met Ser Gly Arg Gly Ser Asp Tyr Ser Lys Ser 
5 30 35 40 

AGG GAG A ? T GCT AAA GCT GCA ACT GCT GTC ACA GCT GGT GGT TCC CTC 
Arg Gin He Ala Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser Leu 
45 50 55 

CTt GTT CTC TCC AGC CTT ACC CTT GTT GGA ACT GTC ATA GCT TTG ACT 
Leu Val Leu Ser Ser Leu Thr Leu Val Gly Thr Val He Ala Leu Thr 
60 65 70 

wT? ° CA ACA CCT CTG CTC GTT ATC TTC AGC CCA ATC CTT GTC CCG GCT 
Val Ala Thr Pro Leu Leu Val He Phe Ser Pro He Leu Val Pro Ala 
75 80 85 

CTC ATC ACA GTT GCA CTC CTC ATC ACC GGT TTT CTT TCC TCT GGA GGG 
Leu lie Thr Val Ala Leu Leu He Thr Gly Phe Leu Ser Ser Gly Glv 
90 95 100 

TTT GGC ATT GCC GCT ATA ACC GTT TTC TCT TGG ATT TAC AA 122 0 



939 



987 



1035 



1083 



1131 



1179 
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Phe Gly He Ala Ala He Thr Val Phe Ser Trp He Tyr 
105 110 H5 

GTAAGCACAC ATTTATCATC TTACTTCATA ATTTTGTGCA ATATGTGCAT GCATGTGTTG 1280 

AGCCAGTAGC TTTGGATCAA TTTTTTTGGT CGAATAACAA ATGTAACAAT AAGAAATTGC 

AAATTCTAGG GAACATTTGG TTAACTAAAT AC GAAATTTG ACCTAGCTAG CTTGAATGTG 

TCTGTGTATA TCATCTATAT AGGTAAAATG CTTGGTATGA TACCTATTGA TTGTGAATAG 

G TAC GCA ACG GGA GAG CAC CCA CAG GGA TCA GAC AAG TTG GAC AGT 
Tyr Ala Thr Gly Glu His Pro Gin Gly Ser Asp Lys Leu Sp iTr 

5 10 15 

"* £ - ™ 22 s S£ J2 as a; s St 2S 5S 21 

as & f as as s: s & s si - - - ™ c S „„, 



45 



75 



GCT AAC GTT TTA GGT GAA GCT CAA AAA CTT AAT GAC TCT CAA GCT CCA 
Ala Asn Val Leu Gly Glu Ala Gin Lys Leu Asn Asp Ser Gin 111 Pro 

B:> 90 95 



120 125 



170 175 



1340 
1400 
1460 
1506 

1554 



Th- a G ? C CAG CAC ACT ACT CTC GTT CCA CGA GGA TCC ATG GAT 1 fisn 

Thr Arg Gly Gly Gin Hi. Thr Thr Leu Val Pro Arg Gly Ser Me? £J 1650 
5U 55 60 

r^t ^ G ? T TTT ATC CAA AGC CTT AAA GAT GAT CCA AGC CAA AGT 

Gin Arg Asn Gly Phe He Gin Ser Leu Lys Asp Asp Pro sir Gin Kr 



1698 



1746 



1794 



SJJ J?! JJo Sa S* ^ T ^ TTC ^ AAA GAT CAA CAA AGC GCC 

Y A.a Asp Ala Gin Gin Asn Asn Phe Asn Lys Asp Gin Gin Ser Ala 

100 105 no 

III ^yr SJJ tTo T G m TG CCT ^ TTA AAC GAA GCG CAA CGT AAC 1842 

Phe Tyr Glu He Leu Asn Met Pro Asn Leu Asn Glu Ala Gin Arg Asn 



m ss is si js s: st ^ s ™ - s s £s ss »»• 
si m si s ^ si si ss si s si s s & s e »» 

loO ice 



tSs - - - - i - Si iS SS S £ Si iS 22 i£ »•« 



2034 



mI? o CT AAC TTA AAC GAA GAA CAA CGC AAT GGT TTC ATC CAA AGC TTA 
Met Pro Asn Leu Asn Glu Glu Gin Arg Asn Gly III ill Gin Ser ZZu 

B0 185 190 

^ ss as si sss si s m si ™ - «i s si ss *>« 
si is |^ s si si ss Si ss 2; sis & s S & si -° 
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CAA CAA AAT GCT TTC TAT GAA ATC TTA CAT TTA CCT AAC TTA AAC GAA 5170 

Gin Gin Asn Ala Phe Tyr Glu He Leu His Leu Pro Asn Leu Asn ctu" 8 
225 230 235 

GAA CAA CGC AAT GGT TTC ATC CAA AGC CTA AAA GAT GAC CCA AGC CAA 22 2 6 

Glu Gin Arg Asn Gly Phe He Gin Ser Leu Lys Asp Asp Pro Ser ^ 

■^u 245 250 



255 



W ?? ^ ? T ? TA G ? A GAA GCT ^ CTA GAT GCT CAA GCA 

Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala 

260 265 270 

CCA AAA GCT GAC AAC AAA TTC AAC AAA GAA CAA CAA AAT GCT TTC TAT 
Pro Lys Ala Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr 
275 280 285 



AACATTTGGC AATAAAGTTT CTTAAGATTG AATCCTGTTG CCGGTCTTGC GATGATTATC 
ATATAATTTC TGTTGAATTA CGTTAAGCAT GTAATAATTA ACATGTAATG CATGACGTTA 
TTTATGAGAT GGGTTTTTAT GATTAGAGTC CCGCAATTAT ACATTTAATA CGCGATAGAA 
AACAAAATAT AGCGCGCAAA C TAG GAT AAA TTATCGCGCG CGGTGTCATC TATGTTACTA 



GAT 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Met Ala Asp Thr Ala Arg Gly Thr His His Asp He He Gly Arg Asp 

10 15 

Gin Tyr Pro Met Met Gly Arg Asp Arg Asp Gin Tyr Gin Met Ser Gly 
20 25 30 

Arg Gly Ser Asp Tyr Ser Lys Ser Arg Gin He Ala Lys Ala Ala Thr 

J3 40 45 

Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Ser Leu Thr Leu 



60 



Val Gly Thr Val He Ala Leu Thr Val Ala Thr Pro Leu Leu Val He 

70 75 80 

Phe Ser Pro He Leu Val Pro Ala Leu He Thr Val Ala Leu Leu He 



2274 



2322 



Su" nl Jl A S AT T A S CT ^ C TTA ACT GAA GAA CAA CGT AAC GGC TTC 2370 
Glu He Leu His Leu Pro Asn Leu Thr Glu Glu Gin Arg Asn Gly Phe 

295 300 

ATC CAA AGC CTT AAA GAC GAT CCG GGG AAT TCC CGG GGA TCC GTC GAC ?*ift 
He Gin Ser Leu Lys Asp Asp Pro Gly Asn Ser Arg Gly Ser Val Sp ^ 

JUD 310 315 

lit Sn iTe ?£ J£ ^ G ^"GCAT GCCTGCAGGT CGATCGTTCA 245 6 
320 325 



2526 
2586 
2646 
2706 
2709 
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85 



90 



95 



Thr Gly Phe Leu Ser Ser Gly Gly Phe Gly He Ala Ala He Thr Val 
100 105 HO 

Phe Ser Trp He Tyr 
115 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 

Tyr Ala Thr Gly Glu His Pro Gin Gly Ser Asp Lys Leu Asp Ser Ala 

Arg Met Lys Leu Gly Ser Lys Ala Gin Asp Leu Lys Asp Arg Ala Gin 
20 25 30 

Tyr Tyr Gly Gin Gin His Thr Gly Gly Glu His Asp Arg Asp Arg Thr 
Ji 40 45 

Arg Gly Gly Gin His Thr Thr Leu Val Pro Arg Gly Ser Met Asp Gin 
u 55 60 

Arg Asn Gly Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala 

70 75 80 

Asn Val Leu Gly Glu Ala Gin Lys Leu Asn Asp Ser Gin Ala Pro Lys 
85 90 95 

Ala Asp Ala Gin Gin Asn Asn Phe Asn Lys Asp Gin Gin Ser Ala Phe 
100 105 110 

Tyr Glu lie Leu Asn Met Pro Asn Leu Asn Glu Ala Gin Arg Asn Gly 
A=l 120 12 5 

Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Thr Asn Val Leu 

135 140 

Gly Glu Ala Lys Lys Leu Asn Glu Ser Gin Ala Pro Lys Ala Asp Asn 

150 155 160 

Asn Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu He Leu Asn Met 
165 170 175 

Pro Asn Leu Asn Glu Glu Gin Arg Asn Gly Phe He Gin Ser Leu Lys 
180 185 190 

Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ser Glu Ala Lys Lys Leu 

200 205 

Asn Glu Ser Gin Ala Pro Lys Ala Asp Asn Lys Phe Asn Lys Glu Gin 



220 



Gin Asn Ala Phe Tyr Glu He Leu His Leu Pro Asn Leu Asn Glu Glu 

230 235 24Q 

Gin Arg Asn Gly Phe He Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser 
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Ala Asn Leu Leu 
260 

Lys Ala Asp Asn 
275 

lie Leu His Leu 
290 

Gin • Ser Leu Lys 
305 



Gin lie Thr Asn 



245 

Ala Glu Ala Lys 

Lys Phe Asn Lys 
280 

Pro Asn Leu Thr 
295 

Asp Asp Pro Gly 
310 

325 



250 

Lys Leu Asn ■ Asp 
265 

Glu Gin Gin Asn 



Glu Glu Gin Arg 
300 

Asn Ser Arg Gly 
315 



255 

Ala Gin Ala Pro 
270 

Ala Phe Tyr Glu 
285 

Asn Gly Phe lie 



Ser Val Asp Leu 
320 
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WE CLAIM : 

!• A method for the separation of a target molecule from a 

sample comprising: 

1) contacting (i) oil bodies with (ii) a sample containing the 
target molecule to allow the target molecule to associate with the oil 
bodies; and 

2) separating the oil bodies associated with the target 
molecule from the sample. 



2. A method according to claim 1 further comprising a ligand 
10 molecule that associates with the oil bodies and the target molecule. 

3. A method according to claim 2 wherein the ligand molecule 
is covalently attached to the oil bodies. 

4. A method according to claim 3 wherein the ligand molecule 
is covalently attached to an oil body protein in the oil bodies. 

15 5 - A met *°d according to claim 2 wherein the ligand is 

comprised of two molecules, a first molecule that associates with the oil 
bodies and a second molecule that associates with the target, wherein the 
first molecule and the second molecule associate with each other. 

6. A method according to claim 5 wherein the first and second 

20 ligand molecules are conjugated to each other. 

7 ' A rnethod according to claim 4 wherein the ligand molecule 

is a protein. 

8- A method according to claim 7 wherein the protein ligand is 

a fusion protein with the oil body protein. 



WO 98/27115 



-67- 



PCT/CA97/00951 



10 



9 - A method according to any one of claims 1 to 8 wherein the 

target molecule is selected from the group consisting of proteins, 
peptides, organic molecules, lipids, carbohydrates, nucleic acids, cells, cell 
organelles, cell components, viruses, metals, metal ions and ions. 

10. A method according to claim 8 wherein the ligand molecule 
is hirudin and the target molecule is thrombin. 

11. A method according to claim 8 wherein the ligand molecule 
is protein A and the target molecule an immunoglobulin. 

12. A method according to claim 8 wherein the ligand molecule 
is metallothionein and the target molecule is cadmium. 

13. A method according to claim 8 wherein the ligand molecule 
is a cellulose binding protein and the target molecule is cellulose. 

14. A method according to claim 8 wherein the ligand molecule 
is a nucleic acid binding protein and the target molecule is a nucleic acid. 

15 15 ' A method according to claim 14 wherein the ligand is a single 

stranded DNA binding protein or an RNA binding protein and the target 
is a single stranded nucleic acid molecule. 

16. A method according to claim 2 wherein the ligand is an 

antibody that binds to the oil body or an oil body protein. 

20 17 ■ A method according to claim 16 wherein the target is a cell, 

cell organelle or cell component capable of binding the ligand antibody. 
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18 - A method according to claim 16 wherein the cell is 

Staphylococcus aureus. 

19. A method according to claim 16 wherein the ligand is a 

bivalent antibody that binds to both the oil body and the target. 

5 20. A method according to claim 6 wherein the ligand is an 

antibody conjugated to avidin and the target molecule is biotin. 

21. A method according to any one of claims 4, 7, 8, 9, 10, 11, 12, 

13, 14, 15 wherein the oil body protein is an oleosin. 



10 



15 



20 



22. A method according to claim 21 wherein the oleosin is 
derived from a plant selected from the group consisting of rapeseed 
(Brassica spp.), soybean (Glycine max), sunflower (Helianthus annuus), 
oil palm (Elaeis guineeis), coconut (Cocus nucifera), castor (Ricinus 
communis), safflower (Carthamus tinctorius), mustard (Brassica spp. and 
Sinapis alba), coriander (Coriandrum sativum) linseed /flax (Linum 
usitatissimum), thale cress (Arabidopsis thaliana) and maize (Zea mays). 

23. A method according to any one of claims 1 to 22 wherein in 
step 1) the oil bodies and. the sample are mixed and then incubated for 
about 1 minute to about 24 hours. 

24. A method according to claim 23 wherein the mixed oil bodies 
and sample are incubated at a temperature range from about 4°C to about 
room temperature. 

25. A method according to any one of claims 1 to 24 wherein the 
oil bodies associated with the target molecule are separated from the 
sample in step (2) by centrifugation, floatation or size exclusion. 



10 
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26. A method according to any one of claims 1 to 25, further 
comprising 3) separating the target molecule from the oil bodies. 

27. A method according to claim 26 wherein the target molecule 
is separated by elution under appropriate conditions. 

28. A method according to any one of claims 1 to 27 wherein the 
oil bodies are obtained from the group of plants consisting of rapeseed 
(Brassica spp.), soybean (Glycine max), sunflower (Helianthus annints), 
oil palm (Elaeis guineeis), coconut (Cocus nucifera), castor (Ricinus 
communis), safflower {Carthamus Hnctorius), mustard (Brassica spp. and 
Sinapis alba), coriander (Coriandrum sativum) linseed /flax (Linum 
usitatissimum), thale cress (Arabidopsis thaliana) and maize (Zea mays). 

29. A composition comprising oil bodies associated with a 
molecule. 

30. A composition according to claim 29, wherein the molecule is 
a target molecule selected from the group consisting of organic 
molecules, lipids, carbohydrates, nucleic acids, cells, cell organelles, cell 
components, viruses, metals, metal ions and ions. 

31. A composition according to claim 30 further comprising a 
ligand molecule that associates with the oil bodies and the target 

20 molecule. 

32. A composition according to claim 29, wherein the molecule is 
a ligand molecule. 



15 



33. A composition according to claim 31 or 32 wherein the ligand 

molecule is covalently attached to the oil bodies. 
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34. A composition according to claim 33 wherein the ligand is 

biotin. 

35 - An affinity matrix for use in separating a target molecule 

from a sample comprising an oil body that can associate with the target 
5 molecule. 



36. An affinity matrix for use in separating a target molecule 

from a sample comprising (a) an oil body and (b) a ligand molecule 
associated with the oil body, wherein the ligand molecule is capable of 
associating with the target molecule. 
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FIGURE 17 
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FIGURE 14 CONT'D 



1534 CAG GAT CTG AAA GAC AGA GCT CAG TAC TAC GGA CAG CAA CAT ACT GGT GGG GAA CAT GAC '593 

QDLKDRAQYYGQQHTGG e h d I 62 

1594 CGT GAC CGT ACT CGT GGT GGC CAG CAC ACT ACT CTC GTT CCA CGA GGA TCC ATG GAT CAA 1653 
RDRTRGGQHTT L V P „ B „ fl 

1654 CGC AAT GGT TTT ATC CAA AGC CTT AAA GAT GAT CCA AGC CAA AGT GCT AAC GTT TTA GGT 1713 

1B3 *»GriQSLKDDP SQS A NVLG 2Q2 

1714 GAA GCT CAA AAA CTT AAT GAC TCT CAA GCT CCA AAA GCT GAT GCG CAA CAA AAT AAC TTC 1773 
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1833 
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AGC 
5 


CAA 
0 


AGC 
5 


ACT 
T 


AAC 
<V 


GTT 


TTA 


1893 
262 


CCG 
P 


AAA 

K 


GCT 
A 


GAT 
D 


AAC 
N 


AAT 
N 


TTC 
F 


AAC 

N 


AAA 


1953 
282 


ATG 
H 


CCT 
P 


AAC 
N 


TTA 
£ 


AAC 


GAA 

£T 


GAA 


CAA 
C? 


CGC 
R 


2013 
302 


AGC 
S 


CAA 
Q 


AGT 
5 


GCT 
A 


AAC 
*V 


CTA 
L 


TTG 
L 


TCA 
5 


GAA 


2073 
322 


GCG 
A 


GAT 
D 


AAC 
AT 


AAA 

/r 


TTC 
F 


AAC 
W 


AAA 

JC 


GAA 
E 


CAA 
0 


2133 
342 


AAC 
N 


TTA 
L 


AAC 
N 


GAA 


GAA 


CAA 
0 


CGC 
i? 


AAT 
W 


GGT 
G 


2193 
362 


AGC 
5 


GCT 
A 


AAC 
N 


CTT 
L 


TTA 
L 


GCA 
A 


GAA 
E 


GCT 
A 


AAA 


2253 
382 


AAC 
N 


AAA 

K 


TTC 
F 


AAC 


AAA 
X 


GAA 


CAA 
0 


CAA 


AAT 

N 


2313 
402 



2314 GCT TTC TAT GAA ATT TTA CAT TTA CCT AAC TTA ACT GAA GAA CAA CGT AAC GGC TTC ATC 2373 
EILRLPNL ?ZEQ RNGFI 



422 



2374 CAA AGC CTT AAA GAC GAT CCG GGG AAT TCC CGG GGA TCC GTC GAC CTG CAG ATA ACA AAT 2433 

G SVD LQlTN 442 
2434 TAG «9« WWfl c.g^^ ^ 

"93 «»"*W««t«,«t W tce W ^ 

2673 caaattaccsrcgcgrc^tytcacecacgttactayat 
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FIGURE 14 



1 ccatggctatacccaacctcggtcttggrcacaccaggaactct.ct.ggxaagct,agctccact:ccccagaaacaaccggc 8 0 

Bl gccaaattgccggaattgctgacctgaagacggaacatcatcgtcgggtccttgggcgattgcggcggaagatgggtcag 160 

161 cttgggcttgaggacgagacccgaatcgagtctgttgaaaggttgttcattgggatttgtatacggagattggtcgtcga 240 

241 gaggtttgagggaaaggacaaatgggtttggctctggagaaagagagtgcggcttt.agagagagaattgagaggt.ttaga 32 0 

321 gagagatgcggcggcgatgacgggaggagagacgacgaggacctgcattatcaaagcagtgacgtggtgaaatttggaac 40C 

4 01 ttttaagaggcagatagatttattatttgtatccattttcttcattgrtctagaatgtcgcggaacaaattttaaaacta 4 80 

4 81 aatcctaaattttrctaattttgttgccaatagtggatatgtgggccgtatagaaggaatctattgaaggcccaaaccca 560 

561 tactgacgagcccaaaggttcgttttgcgttrtatgtttcggttcgatgccaacgccacattctgagctaggcaaaaaac 640 

641 aaacgtgtctttgaatagactcctct.cgttaacacatgcagcggctgcat.ggtgacgccattaacacgtggcctacaatt ' 720 

721 gcatgatgtctccattgacacgtgacttctcgtctcctttcttaatatatctaacaaacactcctacctcttccaaaata 800 

801 tatacacatctttttgatcaatctctcattcaaaatctcattctctctagtaaacaagaacaaaaaa ATG GCG GAT 8 76 
1 MAD 3 

877 ACA GCT AGA GGA ACC CAT CAC GAT ATC ATC GGC AGA GAC CAG TAC CCG ATG ATG GGC CGA 936 
4TARGTH HDIIGRDQYPMMG R 23 

937 GAC CGA GAC CAG TAC CAG ATG TCC GGA CGA GGA TCT GAC TAC TCC AAG TCT AGG CAG ATT 996 
24 D R D Q Y Q M S G RGSD YSKS R Q I 43 

997 GCT AAA GCT GCA ACT GCT GTC ACA GCT GGT GGT TCC CTC CTT GTT CTC TCC AGC CTT ACC 1056 
44AK *ATAVTAGGSLLVLSSLT « 

1057 CTT GTT GGA ACT GTC ATA GCT TTG ACT GTT GCA ACA CCT CTG CTC GTT ATC TTC AGC CCA 1116 
64LVG TVIALTVATPLLVIFSP 83 

1117 ATC CTT GTC CCG GCT CTC ATC ACA GTT GCA CTC CTC ATC ACC GGT TTT CTT TCC TCT GGA 1176 
84ILV *ALITVALLITGFLSSG 103 

1177. GGG TTT GGC ATT GCC GCT ATA ACC GTT TTC TCT TGG ATT TAC AA gtaagcacacatttatcatct 1241 
104GF GIAAITVFSWiy K 118 

1242 tactrcataattttgtgcaatatgtgcatgcatgtgttgagccagtagctttggatcaatttttttggtcgaataacaaa 1321 

1322 tgtaacaataagaaattgcaaar-ctagggaacatttggttaactaaatacgaaatttgacctagctagcttgaatgtgt 1401 

1402 ctgtgtatatcatctatataggtaaaatgcttggtatgatacctattgatrgtgaatag G TAC GCA ACG GGA 1473 

V A T G 122 

14 74 GAG CAC CCA CAG GGA TCA GAC AAG TTG GAC AGT GCA AGG ATG AAG TTG GGA AGC AAA GCT 1533 

123 EHpQGSDKLDSARMKLG'SKA 142 
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FIGURE 13 



NCOl 



1 
1 

62 



CTCC ATG CAT CAA CTC ftflT HIT TTT {ijr CAA AGC CTT AAA GAT GAT CCA AGC CAA AG7 GCT 6< 
DQRNGFI QSLKDDPSQSA 1 9 



20 r r r r r r r r r r r r r r P CCA r r r r r 5; 1 
\" r r r r r r r r r r r r r r r r° r r r r , i , ti 

\ 8 o r r r r r r r r r r r r r r r r r r r r 241 



2 8o r r r r r r gct *** *** ™ gaa tct caa gc * c « ^ aac 

^sqapkadn 

302 AAT TTC AAC AAA GAA CAA CAA AAT GCT TTC TAT GAA ATC TTC AAT ATG CCT AAC TTA AAC 

in e gaa r r r r g ggt r c r r r r A r r r r r r r gct ^ 
;« r r s tca r r r r r r r tct ^ gca ccg ^ gcg ~* - 

UAPK A-DNKF 159 

m r r r r ™ r r r r r r r* cat tta cct ^ tta g - 



301 
99 

361 
119 

421 
139 



541 

L N E E 179 



in t r r r r r r r r r r gac cca agc caa agc gct ™ ^ «* - 

IQSLKDD p SQSANLL 199 
602 GCA GAA GCT AAA AAG CTA AAT GAT GCT CAA GCA CCA AAA GCT r.r . 

200 A E A K K T m GCT GAC AAA TTC AAC AAA 

^APKADNKFNK 

662 GAA CAA CAA AAT GCT TTC TAT GAA ATT TTA CAT TTA CCT AAC TTA ACT GAA GAA CAA CCT 

h^pnlteeqr 

722 AAC GGC TTC ATC CAA AGC CTT AAA GAC GAT CCG GGG AAT TCC CGG Gr* >rnr ^ 
240 N G F I Q S l K D D P G « I GT ° CT<t 781 

^ GNSRG SVDL 259 

782 CAG ATA fl r A Afl T TAG AAGCTTGC 

260 Q I T H + HindHI 8 °4 

264 



661 
K 219 

721 
239 
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FIGURE 12 
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FIGURE 11 
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FIGURE 9 
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17 *nl TGC *** GAG TGC *** TGC ACC TCC TGC AAA AGC TGC 7GC 1772 

Z08CAGSCKCKECKCTSCKKSCC 227 

1773 TCC TGC TGT CCT GTG GGC TCT GCC AAG TGT GCC CAG GGC TGC ATC TGC AAA GGG GCG TCG 1832 
28SCCPVGCAK CAQGCICKGAS 247 

'III T° C GC f C C CC C GT T C 9C " CC 9 C ""9 C ^=-g«at g a gt t==aa g a t „ t t tg . ga c 9 a ag 1904 

255 

1905 tt.gt:tggt t gt«tt«tg,aaettt«ttt««9Cttgt««tg tg g M «g..cgt 9 t g ge« et ^ W tttttaaa t gttgg 1984 
1985 ^"taaagatg«r=cr t tgga tt aacragta ttt «cc t attgg t t t ca t gg«t t agcac a caa C att.t t aaa t a t g 2 064 
2065 "gttagatgatargctgcctgctttattatttacttacccctcaccttcagtrtcaaagttgttgcaatgactctgtgt 2144 
2145 -gtttaagarcgagtgaaagtagatrttgtctatatttattaggggtatttgatatgctaatggtaaacatggtttacga 2224 
2225 "9C9tac t t t tt t ggttatggrgttgacgttt:cc tt «aaaca tt atag t agcgt:cct t gg t = t g t g tt car t g gtt g a 23 04 
2305 ■caaaggeaeactcacttggagatgcegtctccactgatatttgaaeaaagaattcggtace 2 366 
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FIGURE 8 

1 ^gctcaaatacgatctgatactgataacgtctagatttttagggttaaagcaatcaatcacctgacgat-.caaggtggt 80 
81 tg^catgacgattccagaaaacatcaagcaagctctcaaagctacactctttgggatcatactgaactctaacaacct 160 
161 c 9ttatgrcccgtag t gccag t acagaca t cctcgtaactcggatta t gcacgatgccatggc t a t acccaacctcggrc 240 
241 "WWacaccaggaactctctggtaagctagcrccactccccagaaacaaccggcgccaaat.gccggaattgctgacc 320 
321 ^"^acggaacatcatcgtcgggtccttgggcgattgcggcggaagatgggtcagcttgggcttgaggacgagacccga 400 
401 "cgagtctgttgaaaggttg^a^gggatttgratacggagattggtcgtcgagaggtttgagggaaaggacaaatg 480 
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